Re: reading text out of ps/pdf

2001-01-15 Thread Herbert Voss
Tuukka Toivonen wrote: > > On Sun, 14 Jan 2001, Jan Goebel wrote: > > > you can maybe scanner/OCR software like GOCR (open source) > > take a look at: > > http://altmark.nat.uni-magdeburg.de/~jschulen/ocr/index.html > > Sure. You can try it. But don't expect too much. When I last time (maybe a

Re: reading text out of ps/pdf

2001-01-15 Thread Tuukka Toivonen
On Sun, 14 Jan 2001, Jan Goebel wrote: > you can maybe scanner/OCR software like GOCR (open source) > take a look at: > http://altmark.nat.uni-magdeburg.de/~jschulen/ocr/index.html Sure. You can try it. But don't expect too much. When I last time (maybe a half year ago) tested all free OCR progr

Re: reading text out of ps/pdf

2001-01-14 Thread Matej Cepl
Christopher Jones wrote: > So my question is: is there any software out there which attempts to look at > bitmaps and guess what the ascii would be-- something like those programs which > read books through a scanner and try to match font characters to the image. And > I say this question is a rea

Re: reading text out of ps/pdf

2001-01-14 Thread Jan Goebel
Hello, you can maybe scanner/OCR software like GOCR (open source) take a look at: http://altmark.nat.uni-magdeburg.de/~jschulen/ocr/index.html good luck jan PS: @christopher: if you were sucessfull, you may give me a reply? maybe i need it sometimes, too. On Sat, 13 Jan 2001, Matej Cepl w

Re: reading text out of ps/pdf

2001-01-13 Thread Matej Cepl
Christopher Jones wrote: > So my question is: is there any software out there which attempts to look at > bitmaps and guess what the ascii would be-- something like those programs which > read books through a scanner and try to match font characters to the image. And > I say this question is a rea

Re: reading text out of ps/pdf

2001-01-13 Thread Herbert Voss
Christopher Jones wrote: > > I have that tool. But some pdf or ps files consist not of coded text but a > bitmapped image. For instance, pdf and ps files which I download from journal > databases are scanned images of journal pages. ps2ascii and pdftotext will not > extract text from these files,

Re: reading text out of ps/pdf

2001-01-13 Thread Christopher Jones
I have that tool. But some pdf or ps files consist not of coded text but a bitmapped image. For instance, pdf and ps files which I download from journal databases are scanned images of journal pages. ps2ascii and pdftotext will not extract text from these files, since there is no ascii content to

Re: reading text out of ps/pdf

2001-01-13 Thread R. E. de Lima-Lopes
topher Jones <[EMAIL PROTECTED]> > To: LyX <[EMAIL PROTECTED]> > Subject: reading text out of ps/pdf > > This is a reach, I know. But in the hopes that there is something out there for > me, I'll ask the question: is there anything which reads text out of a bitmaped > pdf or ps file? >

reading text out of ps/pdf

2001-01-13 Thread Christopher Jones
This is a reach, I know. But in the hopes that there is something out there for me, I'll ask the question: is there anything which reads text out of a bitmaped pdf or ps file?