top | item 32314904

(no title)

I’d be interested in discussing this more. I’ll shoot you an email.

After a quick search, I found pdfgrep[0] which sounds like one simple way to solve this problem. But I think it only runs on Linux.

[0] https://pdfgrep.org/

discuss

MWil|3 years ago

Not sure if you were one of the people who shot me an email or not, but I don't think pdfgrep captures the essence of the project. It's more annotations/note-taking/human labor review of pdfs than regex/search. Although that's not to say that OCR of the pdfs would not be involved - I'm a big fan of OCRmyPDF.

But this is also not intended to have any command-line interface. I can do it, but that's definitely not the audience I'm building for. Quite the opposite.