(no title)
breadislove | 4 months ago
- the OmniAI benchmark is bad
- Instead check OmniDocBench[1] out
- Mistral OCR is far far behind most Open Source OCR models and even further behind then Gemini
- End to End OCR is still extremely tricky
- composed pipelines work better (layout detection -> reading order -> OCR every element)
- complex table parsing is still extremely difficult
hakunin|4 months ago
wahnfrieden|4 months ago
graeme|4 months ago
CaptainOfCoit|4 months ago
cheema33|4 months ago
According to Omni OCR benchmark, Omni OCR is the best OCR. I am sure you all will find no issues with these findings.