The command-line tool "pdftotext" will extract text from a PDF file, and "pdfimages" will extract images from a PDF file. Both tools are in the rpm package "poppler-utils".
If you're using debian or ubuntu, it's possible that the .deb package is named differently, but I assume it's available on those distributions. On Mon, Nov 27, 2023 at 2:42 PM Rich Pieri <richard.pi...@gmail.com> wrote: > On Mon, 27 Nov 2023 09:55:04 -0800 > Kent Borg <kentb...@borg.org> wrote: > > > > and that any attempt to read the raw text of the email had been > > > blocked in Thunderbird. > > They manage to disable "View"->"Message Source Ctrl+U"? That is > > impressive. > > If they buried the whole thing in the PDF file then there is no raw > message text. And never mind that this violates all the mail handling > standards and never mind the ADA. > > Anywho, it's entirely possible that there is no text at all, and the > PDF is bitmap image(s). A simple PDF viewer like Sumatra, which doesn't > have a JavaScript interpreter, should make this apparent, or that it's > all embedded JavaScript. > > -- > \m/ (--) \m/ > _______________________________________________ > Discuss mailing list > Discuss@lists.blu.org > http://lists.blu.org/mailman/listinfo/discuss > -- John Abreau / Executive Director, Boston Linux & Unix Email: abre...@gmail.com / WWW http://www.abreau.net / PGP-Key-ID 0x920063C6 PGP-Key-Fingerprint A5AD 6BE1 FEFE 8E4F 5C23 C2D0 E885 E17C 9200 63C6 _______________________________________________ Discuss mailing list Discuss@lists.blu.org http://lists.blu.org/mailman/listinfo/discuss