top | item 45677871

(no title)

shikon7 | 4 months ago

Seems we're now at a point of time when OCR is doing so well, that printing text out and letting computers literally read it is suggested to be superior to processing the endoded text directly.

discuss

order

Legend2440|4 months ago

Neural networks have essentially solved perception. It doesn't matter what format your data comes in, as long as you have enough of it to learn the patterns.

Sharlin|4 months ago

The information density of a bitmap representation of text is just silly low compared to normal textual encodings, even compressed.

programmarchy|4 months ago

PDF is arguably a confusing format for LLMs to read.