On 7/21/07, Wayne Topa <[EMAIL PROTECTED]> wrote:
Nelson Castillo([EMAIL PROTECTED]) is reported to have said:
> On 7/21/07, Osamu Aoki <[EMAIL PROTECTED]> wrote:
> >On Sat, Jul 21, 2007 at 10:53:09PM +0200, Florian Kulzer wrote:
> >> On Sat, Jul 21, 2007 at 22:25:43 +0200, Rodolfo Medina wrote:
> >> Why not use the Debian package? It is called "tesseract-ocr".
> >
> >Yes. But it is old 1.02 version and has FTBFS bug.
>
> Yes, it's old. I installed from sources but I don't get the charsets.
>
> tesseract test.tiff out
> Unable to load unicharset file /usr/local/share/tessdata/eng.unicharset
>
> How do I get them?
1. apt-cache search tesseract-ocr
tesseract-ocr - Command line OCR tool
tesseract-ocr-data - Command line OCR tool data
2. aptitude install tesseract-ocr tesseract-ocr-data
3. less /usr/share/doc/tesseract-ocr/README
This in in testing. YMMV if your running etch.
Hi.
I run sid. I wanted the latest version. The Debian installation is OK.
But it's old.
Now I just noticed that the language files are not installed by default.
I just found this:
To be completely language independent, there is *no* language
data with the source, so you have to download a separate language
file to get it to work at
http://groups.google.com/group/tesseract-ocr/browse_thread/thread/2b11730eae611b40/2a780e0d6227cb02#2a780e0d6227cb02
Regards.
--
http://arhuaco.org
http://emQbit.com
--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]