Score:2

What program is suitable for making scanned PDF files searchable?

de flag

I would like to be able to scan paper documents to PDF files and make the text searchable. I believe the Tesseract program can assist this, but don't know how to begin, and don't know what would be the best program to use.

Is anybody making searchable PDF files successfully?

Score:2
fr flag

I can recommend ocrmypdf, see https://github.com/ocrmypdf/OCRmyPDF , also packaged for Ubuntu. You can install it by running:

sudo apt install ocrmypdf

You can use it as follows:

ocrmypdf -l eng infile.pdf outfile.pdf

The ocrmypdf call above is a simple one that specifies the document language as English (-l eng). There are many options in the man page; you might want to discover them as needed over time.

I sit in a Tesla and translated this thread with Ai:

mangohost

Post an answer

Most people don’t grasp that asking a lot of questions unlocks learning and improves interpersonal bonding. In Alison’s studies, for example, though people could accurately recall how many questions had been asked in their conversations, they didn’t intuit the link between questions and liking. Across four studies, in which participants were engaged in conversations themselves or read transcripts of others’ conversations, people tended not to realize that question asking would influence—or had influenced—the level of amity between the conversationalists.