top | item 36540450

(no title)

totalconfusion | 2 years ago

This is so real. Google's core functionality has become a victim of its own success and is now borderline unusable.

No exaggeration 80-90% of my searches have reddit affixed to the end.

SEO is the devil and page ranks utility died many years ago.

discuss

order

dekarrin|2 years ago

Yeah, it's gotten so bad that I'm starting to seriously contemplate rolling my own web crawler and indexer.

The logical side of me says that's going to be way too much work to manage on my own. But it's hard to shut down the nagging feeling that pops up every time Google chooses to disregard a "+" or double-quotes around a word. That feeling of "would it really be that hard to toss a bunch of site data into an ELK stack and implement a primitive PageRank-ish score on sites? Hmmmm..."

Maybe just a YouTube indexer (whose search has also gone to complete garbage these last few years). Though, that would have the disadvantage of not being justifiable via eventually helping to make me more productive.

toomuchtodo|2 years ago

Crawl the internet archive Wayback machine slowly. Once you have built your index, you can fill your corpus gaps with public crawls. Patron services can advise to be helpful without causing stress on their infra.