Show HN: OCR pipeline for ML training (tables, diagrams, math, multilingual) (github.com) 170 pts| 11 months ago | 38 comments