On November 27, 2023, John Abreau wrote: >The command-line tool "pdftotext" will extract text from a PDF file, and >"pdfimages" will extract images from a PDF file. Both tools are in the rpm >package "poppler-utils".
And if the text happens to be in images, the package ocrmypdf can convert the images to text. Do this before running pdftotext. Dan _______________________________________________ Discuss mailing list Discuss@lists.blu.org http://lists.blu.org/mailman/listinfo/discuss