Re: OCR questions

2007-07-24 Thread Jörg-Volker Peetz
Rodolfo Medina wrote: > Rodolfo Medina wrote: > >> I tried gocr and the result was quite miserable. Then I tried with MS >> Windows >> and it was almost perfect. Somewhere in the web I read that OCR software >> under >> Linux is very poor at the moment and that it's better to use MS Windows for

Re: OCR questions

2007-07-22 Thread Rodolfo Medina
Rodolfo Medina wrote: > I tried gocr and the result was quite miserable. Then I tried with MS Windows > and it was almost perfect. Somewhere in the web I read that OCR software > under > Linux is very poor at the moment and that it's better to use MS Windows for > that: unfortunately my test see

Re: OCR questions

2007-07-21 Thread Osamu Aoki
On Sat, Jul 21, 2007 at 07:54:40PM -0500, Nelson Castillo wrote: > On 7/21/07, Wayne Topa <[EMAIL PROTECTED]> wrote: >> Nelson Castillo([EMAIL PROTECTED]) is reported to have said: >> > On 7/21/07, Osamu Aoki <[EMAIL PROTECTED]> wrote: >> > >On Sat, Jul 21, 2007 at 10:53:09PM +0200, Florian Kulzer

Re: OCR questions

2007-07-21 Thread Nelson Castillo
If you install as stated above with aptitude, tesseract-ocr-data is automatically installed unless you change default behavior of aptitude. FTBFS is just package issue. This package should work. Otherwise, please file bug report. Osamu, thanks a lot. The package works well. Sorry -- if I was

Re: OCR questions

2007-07-21 Thread Nelson Castillo
On 7/21/07, Wayne Topa <[EMAIL PROTECTED]> wrote: Nelson Castillo([EMAIL PROTECTED]) is reported to have said: > On 7/21/07, Osamu Aoki <[EMAIL PROTECTED]> wrote: > >On Sat, Jul 21, 2007 at 10:53:09PM +0200, Florian Kulzer wrote: > >> On Sat, Jul 21, 2007 at 22:25:43 +0200, Rodolfo Medina wrote:

Re: OCR questions

2007-07-21 Thread Wayne Topa
Nelson Castillo([EMAIL PROTECTED]) is reported to have said: > On 7/21/07, Osamu Aoki <[EMAIL PROTECTED]> wrote: > >On Sat, Jul 21, 2007 at 10:53:09PM +0200, Florian Kulzer wrote: > >> On Sat, Jul 21, 2007 at 22:25:43 +0200, Rodolfo Medina wrote: > >> Why not use the Debian package? It is called "t

Re: OCR questions

2007-07-21 Thread Nelson Castillo
On 7/21/07, Osamu Aoki <[EMAIL PROTECTED]> wrote: On Sat, Jul 21, 2007 at 10:53:09PM +0200, Florian Kulzer wrote: > On Sat, Jul 21, 2007 at 22:25:43 +0200, Rodolfo Medina wrote: > Why not use the Debian package? It is called "tesseract-ocr". Yes. But it is old 1.02 version and has FTBFS bug.

Re: OCR questions

2007-07-21 Thread Osamu Aoki
On Sat, Jul 21, 2007 at 10:53:09PM +0200, Florian Kulzer wrote: > On Sat, Jul 21, 2007 at 22:25:43 +0200, Rodolfo Medina wrote: > Why not use the Debian package? It is called "tesseract-ocr". Yes. But it is old 1.02 version and has FTBFS bug. If anyone here is interesed to help maintain update w

Re: OCR questions

2007-07-21 Thread Florian Kulzer
On Sat, Jul 21, 2007 at 22:25:43 +0200, Rodolfo Medina wrote: [...] > I installed tesseract with configure, make, make install, then tried to run it > but got the following error message: > > Unable to load unicharset file /usr/local/share/tessdata/eng.unicharset > > . In the README file ther

Re: OCR questions

2007-07-21 Thread Rodolfo Medina
Rodolfo Medina wrote: > I tried gocr and the result was quite miserable. Then I tried with MS Windows > and it was almost perfect. Somewhere in the web I read that OCR software > under > Linux is very poor at the moment and that it's better to use MS Windows for > that: unfortunately my test see

Re: OCR questions (was: How to acquire text so to edit it?)

2007-07-21 Thread Andrew Sackville-West
On Sat, Jul 21, 2007 at 08:10:27PM +0200, Bob Proulx wrote: > Rodolfo Medina wrote: > > Somewhere in the web I read that OCR software under Linux is very > > poor at the moment and that it's better to use MS Windows for that: > > unfortunately my test seems to confirm that. What do you Debian > >

Re: OCR questions (was: How to acquire text so to edit it?)

2007-07-21 Thread Bob Proulx
Rodolfo Medina wrote: > Somewhere in the web I read that OCR software under Linux is very > poor at the moment and that it's better to use MS Windows for that: > unfortunately my test seems to confirm that. What do you Debian > listers think? I think you should check out these articles. http:/

OCR questions (was: How to acquire text so to edit it?)

2007-07-21 Thread Rodolfo Medina
Rodolfo Medina <[EMAIL PROTECTED]> wrote: >> > Excuse the basic question: >> > I wish to scan a printed text so to have it in an editable text file. >> > How can I do that with `sane' and `scanimage'? On Fri, Jun 08, 2007 at 08:57:03AM -0400, Celejar wrote: >> Scanners scan to image formats.