top | item 42955441

(no title)

oedemis | 1 year ago

there is also https://ds4sd.github.io/docling/ from ibm research which is mit license and track bounding boxes as rich json format

discuss

order

pbronez|1 year ago

Docling has worked well for me. It handles scenarios that crashed ChatGPT Pro. Only problem is it's super annoying to install. When I have a minute I might package it for homebrew.

disqard|1 year ago

Did you compare it to tesseract?

If it's superior (esp. for scans with text flowing around image boxes), and if you do end up packaging it up for brew, know that there's at least one developer who will benefit from your work (for a side-project, but that goes without saying).

Thanks in advance!