top | item 41869821

(no title)

etewiah | 1 year ago

You've got me thinking. Would this work for real estate data? A lot of sites make it quite hard to grab their raw data. Also, perhaps it could gain some insights from the photos...

discuss

order

TechDebtDevin|1 year ago

Been scraping real estate data off every major real estate site for a while. They practically give away their data, there's zero reason to introduce an added cost for llms.

Sure you could do this, and it would work, but you'd spend about 100000x what I do with a $10 Hetzner VPS and a small amount of proxy bandwidth.

bambax|1 year ago

It's crazy to think we live in a world where video to llm ocr is simpler (and cheaper?) than plain old html parsing. Maybe someone will rebuild the Twitter API like this?!?

simonw|1 year ago

I'm certain it would. That would be a really fun experiment to run!

jerpint|1 year ago

Could also work for social media which can be hard to scrape