(no title)
nhirschfeld | 1 year ago
It's both. The OCR part is ofc CPU bound, but the entire text extraction involves reading files, or writing and then reading files.
Without async, these simply block.
As for efficiency - if you're working in an async application context you have to "asyncify" these operations or suffer the consequences.
skavi|1 year ago
nhirschfeld|1 year ago
There are alternative options to tesseract ofc.