tlofreso | 1 year ago | on: Show HN: SeekStorm – open-source sub-millisecond search in Rust
tlofreso's comments
tlofreso | 1 year ago | on: Launch HN: Midship (YC S24) – Turn PDFs, docs, and images into usable data
tlofreso | 1 year ago | on: Launch HN: Midship (YC S24) – Turn PDFs, docs, and images into usable data
I suspect this problem gets solved in the next iteration or two of commodity models. In the meantime, being smart about how the context gets divvied works ok.
I do like the UI you appear to have for citing information. Drawing the polygons around the data, and then where they appear in the PDF. Nice.
tlofreso | 1 year ago | on: Launch HN: Midship (YC S24) – Turn PDFs, docs, and images into usable data
Agree.
The capability is fairly trivial for orgs with decent technical talent. The tech / processes all look similar:
User uploads file --> Azure prebuilt-layout returns .MD --> prompt + .MD + schema set to LLM --> JSON returned. Do whatever you want with it.
tlofreso | 1 year ago | on: Launch HN: Midship (YC S24) – Turn PDFs, docs, and images into usable data
tlofreso | 1 year ago | on: Launch HN: Midship (YC S24) – Turn PDFs, docs, and images into usable data
tlofreso | 1 year ago | on: Migrating Mess with DNS to Use PowerDNS
Long before Bert was writing articles on the source code of mRNA vaccines, he helped build PowerDNS. He talks about that in a three part series starting here: https://berthub.eu/articles/posts/history-of-powerdns-1999-2...
A fascinating individual...
tlofreso | 2 years ago | on: Edmunds EV Charging Test: How fast does each EV charge?
That said, Ford aligning on 150kW chargers across their EV portfolio is a miss. I really hope 250kW is road mapped for next gen Ford EVs when they adopt NACS.
tlofreso | 2 years ago | on: Edmunds EV Charging Test: How fast does each EV charge?
tlofreso | 2 years ago | on: Real-Time Air Quality Index Visual Map
tlofreso | 4 years ago | on: Pay transparency: which states have laws and do they work?
1: https://twitter.com/digitalocean/status/1395818629657149445.
2: https://www.pwc.com/us/en/careers/coloradoifsseniormanager.h...
tlofreso | 5 years ago | on: Turn recipe websites into plain text
It seems you're using the same Python scraper I am: https://github.com/hhursev/recipe-scrapers
tlofreso | 6 years ago | on: The Next Leap in Self-Driving: Prediction
Essentially giving them 'The Flash' like vision. You could then have all kinds of predictive models based on various patterns.
How's SeekStorm's prowess in mid-cap enterprise? How hairy is the ingest pipeline for sources like: decade old sharepoint sites, PDFs with partial text layers, excel, email.msg files, etc...