top | item 2948873

You can't google 9999999..99999999999999999999999

364 points| reg29 | 14 years ago |google.com

80 comments

order
[+] waterhouse|14 years ago|reply
On a faintly related note, I once was calculating partitions, using a relatively inefficient method (memoization: if f(n,k) is the number of distinct ways to express n as the unordered sum of integers no greater than k, then f(n,k) = f(n-k,k) + f(n,k-1)). My computer started to feel the strain in the thousands (this algorithm is O(n^2) in space and time). I then googled for the partition of, say, 1034, which is:

  91363785902248291467082481888195
I figured that chances are that no website will have that integer on there by accident. Lo and behold, I found, among other results, a text file containing the partitions up to 10,000, presumably done with a more efficient algorithm (likely Euler's ridiculous pentagonal-number recurrence formula, which, memoized, is roughly O(n^1.5) in time and O(n) in space): http://oeis.org/A000041/b000041.txt
[+] Sniffnoy|14 years ago|reply
It should be pointed out here for everyone else that in general, if you want terms of a well-known integer sequence, OEIS (and the links section for longer tables like that) is a good place to check.
[+] jwr|14 years ago|reply
This is why I love Hacker News. The pure hack value of some comments is mind-blowing.
[+] wbhart|14 years ago|reply
As coincidence would have it, a friend just announced to me that he has written code to compute the number of partitions of 10^12 in 109 seconds.
[+] mattdeboard|14 years ago|reply
Ridiculous in what way? Ridiculously good compared to alternatives, or ridiculously complex or? Asking out of pure ignorance here.
[+] buro9|14 years ago|reply
The double dot indicates that this is a ranged query.

11..22 would be to search for all integers between 11 and 22.

The effect is that the search is too broad. The problem is more likely to be that the range search is a mapreduce that performs a search for each item in the range.

You can imagine why that's a bad idea, and some aggressive timeouts are probably what stop it from going too far.

Plus the numbers in the range of the OP search are likely to fall in the ranges of sensitive numbers, credit cards being the most sensitive... which are likely to be explicitly blocked.

[+] threecreepio|14 years ago|reply
Then why does 100000..100000 get the same error? Not a huge span, and not likely to contain anything all too sensitive.
[+] adgar|14 years ago|reply
> The effect is that the search is too broad. The problem is more likely to be that the range search is a mapreduce that performs a search for each item in the range.

I'm a bit confused to see this so highly upvoted. I'm not sure you understand what MapReduce is for. MapReduce is not a technology designed for executing a search query in under 100ms, it's for massive data processing jobs. MapReduce is what you would use to do to prepare an index, not to query it.

> You can imagine why that's a bad idea, and some aggressive timeouts are probably what stop it from going too far.

As you yourself noted, this is blocked to prevent people from searching for SSN/credit card/etc. dumps.

[+] user24|14 years ago|reply
The reason for this is that you can google ranges of numbers, and people a couple of years ago were using this feature to find credit card numbers that were posted online, eg searching for 4000000000000000..4999999999999999

It was around the time that johnny's google hacking page became popular, iirc. (http://www.hackersforcharity.org/ghdb/)

[+] ck2|14 years ago|reply
You also cannot google some stuff commonly found in phpBB because of so many hacked sites out there.

Also remember how you used to be able to search for anything up to the 1000th item (10 pages of 100 results). Not anymore for a long time now. Google sucks it all in but won't share and play nice with others.

Why not allow such searches unless a bot is detected (too many pages too quickly, etc.)

[+] david927|14 years ago|reply
"With all due respect John, I am the head of IT and I have it on good authority that if you type 'Google' into Google, you can break the Internet." -- The IT Crowd
[+] martinkallstrom|14 years ago|reply
I think this question is cool to think about and try to answer: What is the lowest integer that doesn't have any hits on google? Is there any reasoning that can help estimating the approximate magnitude it should be?
[+] JacobAldridge|14 years ago|reply
You could look at the hits returned on increasing numbers (eg, 1111 v 11111). A quick glance shows they seem to reduce towards zero by the time you reach 12 digits.

From there, it's probably trial and error - start with 12 1's in a row and fiddle with them. Even try deleting 2 random digits to see what happens (from 12 ->10 as anything with 11 numbers returns a UPS Package Tracking Link / Ad for me). Lowest I had is 111232111222 - perhaps someone should write a Wikipedia page on it?

http://en.wikipedia.org/wiki/Interesting_number_paradox

[+] Almaviva|14 years ago|reply
What's the lowest integer that will never have any hits on Google? It must exist by induction, I think.
[+] unknown|14 years ago|reply

[deleted]

[+] jergosh|14 years ago|reply
Guess they don't want you to search for credit card numbers
[+] arrowgunz|14 years ago|reply
Can someone please explain me why that happens? Google says - "Our systems have detected unusual traffic from your computer network. Please try your request again later."
[+] wisty|14 years ago|reply
It seems to slow down, as you increase from:

111..111 11111..111111

and eventually crashes.

I'm guessing there is some kind of grammar parser breaking horribly. There's grammar parsers that have extremely poor performance trying to parse stuff that looks like this. It might do:

search 1 and 111111..11111

search 11 and 11111..11111

...

search 1 and 1 and 11111..111

...

...

...

and google detects that you are DOSing its server.

OK, someone else has pointed out that this is a range search, and google may be protecting CC searches.

[+] tonyedgecombe|14 years ago|reply
Years ago somebody discovered you could Google for credit card numbers that had been left on the web inadvertently, by searching for the right number range. I wonder if that is why it is unavailable.
[+] YokoZar|14 years ago|reply
Do you regularly search for 99999...999999? Sounds like it's unusual to me!
[+] tsycho|14 years ago|reply
Reading the comments here gave me an idea....google your own credit card numbers to check if its already in some scammer's index. While no results might not necessarily mean you are safe, a positive match is a clear red flag.
[+] andrenotgiant|14 years ago|reply
Even better, LOCK DOWN YOUR GOOGLE ACCOUNT WITH DOUBLE AUTHENTICATION, then set Google alerts for your:

credit card, or if paranoid, partial credit card # "123456789"

Email address

Home address

Phone number

screen name (e.g.: "andrenotgiant" -site:ycombinator.com)

Bank acct partial

Password, Password partial "p@ssw0r"

[+] iclelland|14 years ago|reply
Better yet, do this several times per day -- see how long it takes to get your numbers into Google Suggest :)
[+] bgentry|14 years ago|reply
Doing that over HTTP (not HTTPS) is a fairly bad idea.
[+] mahrain|14 years ago|reply
Funny how Duckduckgo lists all sites linking to this story.
[+] n0fair|14 years ago|reply
Our systems have detected unusual traffic from your computer network. Please try your request again later. Why did this happen?
[+] atlbeer|14 years ago|reply
Because what Google is preventing is the discovery of accidental credit cards in it's index.

You searching for credits cards (via that ranged query) is a flag that the search is possibly automated and they display the warning and catchpa as a mitigator.

[+] program|14 years ago|reply
Neither

  1111111..11111111111111111111111
  11111111..1111111111111111111111
  111111..111111 // minimum case
or any other integer combination. 6 digits is the minimum amount that spawn the bizarre error.
[+] chanux|14 years ago|reply
000000..000000 //minimum case?

I know, the point is this happens with 6 digits minimum on each side of ..

[+] yogsototh|14 years ago|reply
100000..100000 Neither
[+] mikkohypponen|14 years ago|reply
However, you can still set Google Alerts for a search like that.
[+] tete|14 years ago|reply
I see why it doesn't work, but why doesn't it work with quotes either?
[+] drungli|14 years ago|reply
I tried also this: 9999999..999999 same results!
[+] skeptical|14 years ago|reply
Probably that string was used as a dork together with some more text to find vulnerabilities on web applications or so. So it was reported as a honey

This does look like a string that could very much be generated by some poorly coded webbapp. An incorrect usage of a floating point number can easily generate such output. If it occurs on a critical part of an application it could very well been used as a dork. Just an hypothesis though.

Back in 2005 google tricks were at their peak. Many hackers experimented with search phrases in order to retrieve interesting/valuable/uncommon/dangerous?/sensitive info from the web. "index of/ .mp3" "apache server at port" being the absolute classic.

More and more people started to jump in the bandwagon, webmasters gradually became more aware of this, and google too. The natural reaction was google honeypot.

http://ghh.sourceforge.net/

But it wasn't too long until google started to remove such features. These days one can hardly search for symbols on google. They the old tricks, most of them will not work, google simply ignores the details and returns a list of results based on the actual words contained in the query. It's becoming a QA machine. That's one of the reasons I switched to duckduckgo.

For a proper reading on the subject check ou the vast website of the, now deceased, great hacker Fravia: http://www.searchlores.org/indexo.htm