top | item 11161863

(no title)

melted | 10 years ago

Google is one of DDGs backends, and the highest quality one at that. DDG does not have its own crawlers or index. Nor could it: those things cost hundreds of millions of dollars per year to run and maintain.

[Edit: apparently this is not true. DDG does not use Google, though it does use Bing and some other broad and narrow coverage search engines]

discuss

order

AdamSC1|10 years ago

DuckDuckGo in no way uses Google. We do use other sources, but Google isn't one of them.

melted|10 years ago

Looks like my information is either out of date or was incorrect all along. Bing seems to be the primary source. They seem to also have a crawler now, though it's not clear how much it really covers. Very cool. Slow and steady wins the race.

graeme|10 years ago

I posted this elsewhere, but I didn't get an answer. Since you work you work for DDG, you might know.

How long would duckduckgo have to grow at its current rate to become an actual blip on Google's radar? On the one hand, you're small. On the other hand, keep up a fast growth rate long enough and you get bigger faster than people's intuitions' expect.

https://duckduckgo.com/traffic.html

mahranch|10 years ago

> Nor could it: those things cost hundreds of millions of dollars per year to run and maintain.

I vehemently disagree. The early web crawlers and indexes did not cost hundreds of millions of dollars to run. Granted, there is significantly less web results than there are today, but the cost you're referring too is the entirety of google's servers. That price tag also includes the cost to host the web traffic of being the number 1 website in the world. You're talking a price tag which is indicative of a final product.

A new search engine would not have those costs initially, and if managed properly from the very beginning, would be able to scale and cover their bills, remaining profitable up until reaching (and hypothetically) replacing google.

melted|10 years ago

Microsoft spends over $1B/yr on maintaining and improving Bing. So yeah, search does cost a fortune if you want to do it reasonably well without relying on others for results. Not only do you have to crawl the web (with different frequencies depending on predicted frequency of content update, etc), you then have to rebuild the index continuously, with reasonable latency. To do all of that, you have to have indexing infrastructure, which in turns requires storage infrastructure, high performanc data processing infrastructure (Hadoop ain't gonna do it, ask Yahoo as to why), which in turn requires high performance networking (or your mapreduce-like workloads will collapse under their own weight), your own datacenter, and your own army of machine learning researchers, systems engineers, hardware engineers, networking engineers, devops, quality engineers (people who improve your scoring function), human eval, etc, etc, etc. If anything, $1B/yr seems to be quite low.