Ask HN: Is there any HTML table scraper generator in python or else?
2 points| jeffjia | 12 years ago
In one of my projects, I happen to need to get some scrapers running for tens of websites to collect rows, columns of tables (<table>, <ul>, <div>). Those tables are well formatted. I have written several scrapers in python, which basically use CSS selector and then do some simple transformation with regular expression. I just wonder whether there is any scraper generator which may take a url and sample target output as input, and produce a scraper automatically?
Any suggestion is welcomed. Thanks in advance.
tonyfelice|12 years ago
The webintro example here (https://github.com/ariya/phantomjs/wiki/Examples) scrapes a specific element.
jeffjia|12 years ago
brandonlipman|12 years ago
Johnie|12 years ago
jeffjia|12 years ago
murtza|12 years ago
http://scrapy.org/
jeffjia|12 years ago
Larrikin|12 years ago
taddeimania|12 years ago
jeffjia|12 years ago