Re: How to get a better result with tesseract

2012-03-06 Thread zdenko podobny
First of all: If it is possible - do not use jpeg for OCR. I think you will need to improve image before OCR. Try to have a look at FAQ[1] for some hints. [1] http://code.google.com/p/tesseract-ocr/wiki/FAQ#Output_it_without_result_or_wrong On Wed, Mar 7, 2012 at 7:05 AM, Roast wrote: > Anyone

Re: Error during using tesseract-ocr

2012-03-06 Thread zdenko podobny
On Tue, Mar 6, 2012 at 6:20 PM, Ivan Mushketik wrote: > Here is my output: > $ tesseract -v > tesseract 3.02 > leptonica-1.68 > > This means your leptonica have no support for image files. For good installation of leptonica you would see something like this: tesseract 3.02 leptonica-1.68 (Mar 14

Re: How to get a better result with tesseract

2012-03-06 Thread Roast
Anyone can help me? On Sat, Feb 25, 2012 at 11:21 AM, Roast wrote: > Hi,all. > > I am using compiled Tesseract V3.0.1 to process images like the image > file attached. But get a bad result even is not usable. > > Here is the command line to process those images, and Tesseract is without > train

Re: extract word-list failed

2012-03-06 Thread Sriranga(78yrs)
David, Thanks for the valuable guidance. Copied dawg2wordlist.exe pasted in the folder n:\Newfolder\ wherein extracted files Kan.unicharset, kan.word-dawg, kan.freq-dawg are located. extract of cmd is reproduced below - with encounter.exe windows messages displayed for word-dawg and freq-dawg. M:

Re: Error during using tesseract-ocr

2012-03-06 Thread Ivan Mushketik
Hello. I tried to change extension of the file, but it did not help. 2012/3/6 Esteban Bordón > Hi. > > I think that you have to change the extension of the image file. It > have to be .tiff instead .tif. > > cheers, > Esteban. > > 2012/3/6 Ivan Mushketik : > > Hello. > > > > I tried to run tess

Re: Error during using tesseract-ocr

2012-03-06 Thread Ivan Mushketik
Here is my output: $ tesseract -v tesseract 3.02 leptonica-1.68 2012/3/6 Zdenko Podobný > This is leptonica error message that indicate problem with image support. > If you have version from today, please send output of: > tesseract -v > > Zdenko > > Dňa 06.03.2012 15:42, Ivan Mushketik wrote

Re: extract word-list failed

2012-03-06 Thread David Eger
Where you put wordlist2dawg.exe, try putting the name of the output list instead. On Friday, March 2, 2012 2:39:33 AM UTC-8, sriranga(79yrsold) wrote: > > I had extracted kan.word-dawg from the Kan.traineddata. I am trying to > convert dawg to wordlist using commandline in cmd as follows: > > **

Python wrapper for Tesseract which gives confidence of the recognition

2012-03-06 Thread J
Hi there, I'm rather new to Tesseract and OCR in general and am keen to get to grips with it for a little project I'm working on. I'm working on a Windows 7 machine with python27. I would like to use a python wrapper, and have seen a few around and have experimented with https://github.com/jbo

Re: Error during using tesseract-ocr

2012-03-06 Thread Zdenko Podobný
This is leptonica error message that indicate problem with image support. If you have version from today, please send output of: tesseract -v Zdenko Dňa 06.03.2012 15:42, Ivan Mushketik wrote / napísal(a): > Hello. > > I tried to run tesseract-ocr v 3.02 with the following params: > tesseract pho

Re: Error during using tesseract-ocr

2012-03-06 Thread Esteban Bordón
Hi. I think that you have to change the extension of the image file. It have to be .tiff instead .tif. cheers, Esteban. 2012/3/6 Ivan Mushketik : > Hello. > > I tried to run tesseract-ocr v 3.02 with the following params: > tesseract phototest.tif output > > but received: > Tesseract Open Source

Error during using tesseract-ocr

2012-03-06 Thread Ivan Mushketik
Hello. I tried to run tesseract-ocr v 3.02 with the following params: tesseract phototest.tif output but received: Tesseract Open Source OCR Engine v3.02 with Leptonica Error in findTiffCompression: function not present Error in pixReadStreamTiff: function not present Error in pixReadStream: tiff