Daniel Kasak <[EMAIL PROTECTED]> wrote: oryann9 wrote:
> I have a scanned PDF file and I want to convert its data so I can store it in > an xls file. What module(s) do you recommened? > > I saw File::Extract::PDF ? If you've scanned it, chances are the PDF doesn't contain any text that you can extract. It will just be 1 image per page. You have to do OCR on it ... and possibly convert the PDF into TIFF of jpeg images or something first. There are a number of open-source OCR engines around, including one that IBM bought and open-sourced. I haven't look at any of them in years though, so I don't know what state they're in. Dan ... Mmm interesting. No I have not scanned it yet. Actually it's my wife's X-mas address book that needs re-written so I was thinking of scanning it then editing it then re-printing it and finally savintg it to an xls file. Or was considering PDF::Labels Routines to produce formatted pages of mailing labels in PDF thx for the info! ...was not aware of that. derek __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com