top | item 46331752

(no title)

GZGavinZhao | 2 months ago

Does it handle math expressions (those rendered from LaTeX) well? I've been looking for a good OCR model to transcribe my math textbooks into markdown (obviously ignoring the images and figures) with LaTeX as math expressions, and none of the current OCR models work reliably enough.

EDIT: you can try it yourself for free at https://console.mistral.ai/build/document-ai/ocr-playground once you create a developer account! Fingers crossed to see how well it works for my use case.

discuss

order

loaf_api|2 months ago

I've just finished processing thousands of documents using the Gemini Pro 3 vision model and it outperformed every OCR and image model I've tested by a long shot, perfect markdown with latex for the math every time.

lysecret|2 months ago

3 flash is also insanely good even slightly outperforms 3 pro for me.

pacman1337|2 months ago

what prompt are you using?

RagnarD|2 months ago

Please post an update on how well it works for you.

nerbert|2 months ago

Just need to open the link to answer that question.