(no title)
mikaraento | 6 months ago
Inverted indices were not used as they worked poorly for “an ordered list of words” (as opposed to a bag of words).
And this doesn’t even start to address the ranking part.
mikaraento | 6 months ago
Inverted indices were not used as they worked poorly for “an ordered list of words” (as opposed to a bag of words).
And this doesn’t even start to address the ranking part.
smokel|6 months ago
Wikipedia [1] links to "Jeff Dean's keynote at WSDM 2009" [2] which suggests that indices were most certainly used.
Then again, I am no expert in this field, so if you could share more details, I'd love to hear more about it.
[1] https://en.wikipedia.org/wiki/Google_data_centers
[2] https://static.googleusercontent.com/media/research.google.c...
bruckie|6 months ago
You can solve the ordered list of words problem in ways that are more efficient than grepping over the entire internet (e.g. bigrams, storing position information in the index).