Ask HN: Please help me with seo. 600k player statistics are not getting indexed.
6 points| mcorrientes | 13 years ago | reply
My site's holding about 600k player statistics and almost none of them are getting indexed. I'm not exactly sure what the reason is. About a week ago google crawled and indexed 300k pages in about 2-3 days but a few days later they dropped everything again.
We have more content than our competitor and the players statistic page is also liked and shared by visitors a lot.
Our competitor is doing quite well, he got about 4.6m of his player statistics indexed.
Cleaning up the HTML a bit, using meta title and descriptions, properly redirecting old site structures to the right place (301) didn't help, google still doesn't bother about indexing the player statistics.
Although they're still crawling some of them but they choose not show them.
I thought it might be because of duplicate content so I moved the languages from the structure (e.g. /kr/ ) to a sub domain (e.g. kr.riot5.com).
I'm not quite sure if my site's under a penalty or if there's something wrong with my content.
I feel a bit overwhelmed of all the possible reasons that might cause google to stop indexing my page and why they once indexed a lot.
I would be really grateful if someone could help me finding the problem.
The site's at http://www.riot5.com/
[+] [-] Metatron|13 years ago|reply
Re-inclusion requests here (pretty much your only way to ask Google anything easily, but responses aren't guaranteed) https://www.google.com/webmasters/tools/reconsideration?pli=...
Or try the Webmaster forums, where Google folk apparently post every once in a while, but it's mainly a clusterfuck of people hijacking your problem with their own questions, and solutions being unreliably crowdsourced.
Remember: Google moves in mysterious ways. We cannot understand their arcane techniques for we are not worthy.
[+] [-] mcorrientes|13 years ago|reply
[+] [-] itsprofitbaron|13 years ago|reply
Your site is not banned in Google - you can check this by searching for "site:riot5.com"
In terms 301'ing your dead links this is a good thing to do however, don't expect them to show in the same place in the SERPs as they were previously at least not immediately.
I didn't find a canonical tag on your pages, not did I find a robots.txt page or sitemap on your site you definitely should add those (you should also submit the Robots.txt and Sitemap to Google Webmaster Tools)
Similarly, as you have created sub-domains for languages you should really be using Google Webmaster Tools and telling Google they're geo-ips. Ideally, you should also .htaccess the pages so when you visit from another country you are redirected to the geoip address.
There are other things you do as well but once you have done those, and along with naturally building links to your site you should notice that your SERPs are returning to where they once were and are improving.
[+] [-] AznHisoka|13 years ago|reply
Why should they index everything on your site if it's not valuable content? It seems like it's just profile data.. but what exactly should it rank for anyway? Google considers this "thin content", and since the Panda update, they've punished sites that have too much of this content (of course, every site has some thin content, but for you, it's the majority)
[+] [-] reefoctopus|13 years ago|reply
That, the bad text/html ratio, and the non-descriptive urls are what i see as the likely culprits. It looks like you have a good number of backlinks, and that that number is growing quickly.
[+] [-] WillyF|13 years ago|reply
Good external links are the best way to get something indexed. Good internal links are probably the second best way.
[+] [-] davidm|13 years ago|reply
Search Google for "Whenever Jarvan III, the king of Demacia, delivers one of his rallying speeches"
[+] [-] kerryfalk|13 years ago|reply
[+] [-] ch00ey|13 years ago|reply
[+] [-] mcorrientes|13 years ago|reply
[+] [-] shyn3|13 years ago|reply
Slow down your growth next time, limit how many profiles they can view per week. 300k at once they probably thought you were a spammer.