top | item 35784102

(no title)

holbue | 2 years ago

As others mentioned, Tesseract is SOTA in FOSS OCR. It also still is being developed, improving slow but constantly.

The main issue for a use-case like NormCap are the trained models: they are optimized for images of _printed_ text and layouts, which is different from on-screen-text in many aspects. Unfortunately, I don't have the resources to train my own models.

Cuneiform was a long time competitor, but afaik development there is stalled.

discuss

No comments yet.