On November 27, 2023, John Abreau wrote:
>The command-line tool "pdftotext" will extract text from a PDF file, and
>"pdfimages" will extract images from a PDF file. Both tools are in the rpm
>package "poppler-utils".

And if the text happens to be in images, the package ocrmypdf can
convert the images to text. Do this before running pdftotext.

Dan
_______________________________________________
Discuss mailing list
Discuss@lists.blu.org
http://lists.blu.org/mailman/listinfo/discuss

Reply via email to