Most of the source code of Google 2022 seem to be on Github separated (PageRank, MapReduce, Freebase, Graphd, etc.). Why isn't there no unified open source 2012 Google-like search engine?
I think that the next search engine is gonna be a service that has an advantage using a technique to identify quality content and ditching the rest, most of the internet is just noise.
I don’t have the resource to run a 2012 search engine. Even with the CommonCrawl dataset, it’s extremely expensive and also not environmental friendly.
[+] [-] fsflover|3 years ago|reply
[+] [-] ffhhj|3 years ago|reply
When future search engines begin receiving feedback from users and ranking accordingly Google will become irrelevant.
[+] [-] tkiolp4|3 years ago|reply
[+] [-] freemint|3 years ago|reply
[+] [-] waterfallr|3 years ago|reply
Wrt to Search,
Scale: Huge cost of infrastructure, data is privatize.
You could run a hobby scale pagerank but for searching web it would be a needle in a haystack.
They have competition like yandex, nacy, baidu but are not for English web and not open source
[+] [-] jfoster|3 years ago|reply
For instance, if Microsoft controlled them, they could just point the search functionality toward Bing, and most users mightn't even notice.
[+] [-] 8note|3 years ago|reply
[+] [-] scottmcdot|3 years ago|reply
[+] [-] is_true|3 years ago|reply
[+] [-] speedgoose|3 years ago|reply