Buch, Englisch, Band 12, 80 Seiten, Format (B × H): 156 mm x 234 mm
Buch, Englisch, Band 12, 80 Seiten, Format (B × H): 156 mm x 234 mm
Reihe: Foundations and Trends® in Information Retrieval
ISBN: 978-1-60198-322-0
Verlag: Now Publishers
This is a survey of the science and practice of web crawling. While at first glance web crawling may appear to be merely an application of breadth-first-search, the truth is that there are many challenges ranging from systems concerns such as managing very large data structures, to theoretical questions such as how often to revisit evolving content sources. This survey outlines the fundamental challenges and describes the state-of-the-art models and solutions. It also highlights avenues for future work.
Autoren/Hrsg.
Fachgebiete
Weitere Infos & Material
1: Introduction 2: Crawler Architecture 3: Crawl Ordering Problem 4: Batch Crawl Ordering 5: Incremental Crawl Ordering 6: Avoiding Problematic and Undesirable Content 7: Deep Web Crawling 8: Future Directions. References




