top | item 8285043

(no title)

wrath | 11 years ago

Amazon has made it much harder to scrape their site in recent months. They now give captchas to bots that try and access their site to often by the same IP address. Unless you have a very large pool of low cost proxies, crawling millions of prices on a daily basis is challenging. I think this is what he's referring to when he says """and has lately made it very difficult to get the data in other ways."""

discuss

order

toomuchtodo|11 years ago

The best way to do this is with your users using a Chrome extension to feed the data back to you. User lands on a page? Send the data back to your scrapper processing API.

jarkarj|11 years ago

I've been scraping Amazon for the past few years. Yeah the captcha that they introduced around last February is a pain, but there are ways around it. I have and still do make my living from scraping (not just Amazon) - lemme know if you are interested and maybe I can help someone out.