Re: How to extract text from PDF?

Mike Bird Wed, 05 Mar 2008 15:45:27 -0800

On Wed March 5 2008 15:20:57 Andrius wrote:
> technical question: is it possible to extract text from PDF? From PDF to
> txt.


If the PDF was built from text, then pdftotext will extract the text.
pdftotext is in the xpdf-utils package.  Be careful: if you don't
explicitly specify an output file pdftotext will create one, possibly
overwriting a file you'd rather not have overwritten.

If the PDF was built from an image - e.g. a scanned document - you'd
need some kind of OCR.

--Mike Bird


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED] 
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Re: How to extract text from PDF?

Reply via email to