Tell HN: The front page of Hacker News has been deindexed from Google
I've checked the usual technical reasons (html head canonical/robots meta tag, http headers, robots.txt issues) but I don't see anything untoward.
I'll keep looking into it, but I'm posting this here in case the admins/mods have made any changes recently that could have had an effect. There's a possibility that the URL has been removed by Google for some particular reason, though I can't think of many pages that deserve it less than HN.
I'll update this thread if I see anything, but hopefully someone else will post an answer before I figure it out....
[+] [-] Matt_Cutts|12 years ago|reply
Here's a link where I answered the same question about three weeks ago: https://news.ycombinator.com/item?id=5837004 , so this isn't a new issue. In fact, PG has been blocking various bots since 2011 or so; https://news.ycombinator.com/item?id=3277661 is one of the original discussions about this.
And to show this isn't a Google-specific issue, note that Bing's #1 result for the search [hacker news] is a completely different site, thehackernews.com: http://www.bing.com/search?q=hacker+news
In general, I think PG's priority is to have a useful, interesting site for hackers. That takes precedence and is the reason why I believe PG blocks most bots: so that crawling doesn't overload the site.
[+] [-] Roedou|12 years ago|reply
Looks like I'm going to have to stop relying on searching 'hn' when using a different computer, and start typing in the full URL. First world problems are such a burden.
[+] [-] chintan|12 years ago|reply
[+] [-] HNLogInShit|12 years ago|reply
[deleted]
[+] [-] jlgreco|12 years ago|reply
HNSearch works great for HN specific searches anyway.
[+] [-] JoeCortopassi|12 years ago|reply
[+] [-] AsymetricCom|12 years ago|reply
[+] [-] gee_totes|12 years ago|reply
[+] [-] eliben|12 years ago|reply
[+] [-] Roedou|12 years ago|reply
https://news.ycombinator.com/item?id=3277661
Could be a similar issue? I'll take a look.
[+] [-] Roedou|12 years ago|reply
In which case, he should add: <meta name="googlebot" content="noindex"> to the html head of every page.
(I have to say, that's a smart way of avoiding any Eternal Septembering, but it'd be a shame. I often use Google to find old HN threads that I vaguely remember from months or years ago.)
[+] [-] glitch273|12 years ago|reply
[+] [-] jffry|12 years ago|reply
[+] [-] mattparlane|12 years ago|reply
https://www.google.co.nz/search?q=site:news.ycombinator.org
[+] [-] meritt|12 years ago|reply
Unlike Digg, HN has a substantial amount of content in the comments pages though, which are heavily indexed.
Edit - All the comment pages are still indexed just fine. It's /only/ the front-page. Which, imo, doesn't really matter anyway.
[+] [-] Roedou|12 years ago|reply
[+] [-] pstuart|12 years ago|reply
[+] [-] aidscholar|12 years ago|reply
[+] [-] eli|12 years ago|reply
http://ycombinator.com/newsguidelines.html
[+] [-] malandrew|12 years ago|reply
[+] [-] chacham15|12 years ago|reply
[+] [-] unknown|12 years ago|reply
[deleted]
[+] [-] gscott|12 years ago|reply
[+] [-] godgod|12 years ago|reply
[+] [-] quantumpotato_|12 years ago|reply
[+] [-] mindstab|12 years ago|reply