Re: OCR questions

Nelson Castillo Sat, 21 Jul 2007 17:54:53 -0700

On 7/21/07, Wayne Topa <[EMAIL PROTECTED]> wrote:

Nelson Castillo([EMAIL PROTECTED]) is reported to have said:
> On 7/21/07, Osamu Aoki <[EMAIL PROTECTED]> wrote:
> >On Sat, Jul 21, 2007 at 10:53:09PM +0200, Florian Kulzer wrote:
> >> On Sat, Jul 21, 2007 at 22:25:43 +0200, Rodolfo Medina wrote:
> >> Why not use the Debian package? It is called "tesseract-ocr".
> >
> >Yes.  But it is old 1.02 version and has FTBFS bug.
>
> Yes, it's old. I installed from sources but I don't get the charsets.
>
> tesseract test.tiff out
> Unable to load unicharset file /usr/local/share/tessdata/eng.unicharset
>
> How do I get them?


1.  apt-cache search tesseract-ocr
tesseract-ocr - Command line OCR tool
tesseract-ocr-data - Command line OCR tool data

2.  aptitude install tesseract-ocr tesseract-ocr-data

3.  less /usr/share/doc/tesseract-ocr/README

This in in testing.  YMMV if your running etch.


Hi.

I run sid. I wanted the latest version. The Debian installation is OK.
But it's old.
Now I just noticed that the language files are not installed by default.

I just found this:

 To be completely language independent, there is *no* language
 data with the source, so you have to download a separate language
 file to get it to work at

http://groups.google.com/group/tesseract-ocr/browse_thread/thread/2b11730eae611b40/2a780e0d6227cb02#2a780e0d6227cb02

Regards.

--
http://arhuaco.org
http://emQbit.com


--

To UNSUBSCRIBE, email to [EMAIL PROTECTED]with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Re: OCR questions

Reply via email to