oryann9 wrote:
I have a scanned PDF file and I want to convert its data so I can store it in
an xls file. What module(s) do you recommened?
I saw File::Extract::PDF ?
If you've scanned it, chances are the PDF doesn't contain any text that
you can extract. It will just be 1 image per page. You have to do OCR on
it ... and possibly convert the PDF into TIFF of jpeg images or
something first. There are a number of open-source OCR engines around,
including one that IBM bought and open-sourced. I haven't look at any of
them in years though, so I don't know what state they're in.
Dan
--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
<http://learn.perl.org/> <http://learn.perl.org/first-response>