Daniel Kasak <[EMAIL PROTECTED]> wrote:    oryann9 wrote:

> I have a scanned PDF file and I want to convert its data so I can store it in 
> an xls file. What module(s) do you recommened?
> 
> I saw File::Extract::PDF ?

If you've scanned it, chances are the PDF doesn't contain any text that 
you can extract. It will just be 1 image per page. You have to do OCR on 
it ... and possibly convert the PDF into TIFF of jpeg images or 
something first. There are a number of open-source OCR engines around, 
including one that IBM bought and open-sourced. I haven't look at any of 
them in years though, so I don't know what state they're in.

Dan

...
   
  Mmm interesting. No I have not scanned it yet.  Actually it's my wife's X-mas 
address book that needs re-written so I was thinking of scanning it then 
editing it then re-printing it and finally savintg it to an xls file.

  Or
   
  was considering 
   
  PDF::Labels 
Routines to produce formatted pages of mailing labels in PDF 

  thx for the info! ...was not aware of that.
  derek
  
 

 __________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 

Reply via email to