(no title)
jd20
|
5 years ago
You should check out Manning's "Introduction to Information Retrieval", it has far more detail about web crawler architecture than I can write in a post, and served as a blueprint for much of Applebot's early design decisions.
giu|5 years ago
The book is freely available online at https://nlp.stanford.edu/IR-book/information-retrieval-book....