top | item 24466087

(no title)

vivekseth | 5 years ago

There’s a little more nuance than that. Even if text is drawn using plaintext data there’s no guarantee that the characters/words appear in the correct order or have the proper white space between them.

discuss

order

person_of_color|5 years ago

The best method is probably to render the PDF and use OCR.

liability|5 years ago

Unfortunately that's obnoxiously inefficient if you're trying to run it through text-to-speech in real time.