Re: Disable Special characters?

2010-04-21 Thread Zdenko Podobný
Hello, maybe this problems if tesseract is not installed to standard place and there is not environment setting (export TESSDATA_PREFIX="directory in which your tessdata resides/") as mentioned in http://code.google.com/p/tesseract-ocr/wiki/ReleaseNotes. I have (on linux) tesseract 2.04 installed

Re: extracting line information

2010-04-21 Thread Neil Benn
Hello, I'm copying in the group to keep this on the group chat - can you do the same please. AFAIK, there is no other library based on tesseract which provides the information you are looking for on windows - sorry. Cheers, Neil On 21 April 2010 16:53, vikas landge wrote: > Hello

Tesseract 3.0 without page layout analysis?

2010-04-21 Thread Jan
Hallo, is it possible to use tesseract 3.0 without page layout analysis, or in one column mode? Especially using the tesseract.exe? Thanks!! -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-...@goog

Re: Generating / Training box files.

2010-04-21 Thread Sriranga(77yrsold)
Marin Pierre, Guidance how to use OCRB.Disambiguation.txt effectively? sample is requested. -sriranga(77yrsold) On Wed, Apr 21, 2010 at 3:32 PM, Sriranga(77yrsold) wrote: > Dear Pirrre, > I tested using OCRB.tif and eurotext.tif and its output are attached > herewith. I used commandline for

Re: Generating / Training box files.

2010-04-21 Thread Sriranga(77yrsold)
Dear Pirrre, I tested using OCRB.tif and eurotext.tif and its output are attached herewith. I used commandline for both tif using tesseract 3.0 version. It is observed that for output texts using *cst*(generated by you) and *eng * datafiles for *OCRB.tif *are identical and found to be in order wh