top | item 35123456

Build a Trial Court Records Scraper Using Ruby

4 points| Heavywater | 3 years ago |joelc.io

2 comments

order

Heavywater|3 years ago

How it works. We will use the Ruby programming language and a few open source software tools (i.e., Nokogiri, Watir, Selenium, and ChromeDriver) to deploy a hidden ("headless") browser to the OECI case index

mdaniel|3 years ago

I will never in my life understand why people go through all the trouble of booting up a headless browser, only then to slurp the HTML back across the WebDriver interface so they can _re-parse_ it using some rando library. Not only is that inefficient, it almost guarantees questions on r/webscraping or SO about "but I see some element in the browser, why is $random_library not parsing it the same as the browser?!11"