Re: Announcement: new version of pyTesseractTrainer available

2010-08-13 Thread Zdenko Podobný
Dňa 14.08.2010 00:17, Jimmy O'Regan wrote / napísal(a): > On 13 August 2010 21:54, zdenko podobny wrote: > >> Hello, >> I would like to announce new version 1.01 of pyTesseractTrainer - successor >> of tesseractTrainer.py Version 1.00 is identical with tesseractTrainer.py. >> Features: >> >> v

Re: Announcement: new version of pyTesseractTrainer available

2010-08-13 Thread Zdenko Podobný
Dňa 13.08.2010 23:19, Robert Komar wrote / napísal(a): > On Fri, 13 Aug 2010, zdenko podobny wrote: > >> Because IFAIK nobody react on Catalin e-mail I offered him to create >> project >> to collect patches and possibly to solve known issues. Because of my low >> time resource project is looking s

Re: No treatment for touching letters?

2010-08-13 Thread రాకేశ్వర రావు
Hi Jimmy, Thank you for your replies. Shouting - so that people find it easy to locate the actual questions. < Well, that's what 'DangAmbigs' are for -- impossible sequences, and their corrections. > >From my understanding of DangAmbigs, I did NOT infer that it could be used for impossible seque

Re: Announcement: new version of pyTesseractTrainer available

2010-08-13 Thread Jimmy O'Regan
On 13 August 2010 21:54, zdenko podobny wrote: > Hello, > I would like to announce new version 1.01 of pyTesseractTrainer - successor > of tesseractTrainer.py Version 1.00 is identical with tesseractTrainer.py. > Features: > > visual editor of box file > layout of symbol from box file reflect symb

Re: Announcement: new version of pyTesseractTrainer available

2010-08-13 Thread Robert Komar
On Fri, 13 Aug 2010, zdenko podobny wrote: Because IFAIK nobody react on Catalin e-mail I offered him to create project to collect patches and possibly to solve known issues. Because of my low time resource project is looking still for owner/contributors. Warmly welcomed are expect for python (m

Announcement: new version of pyTesseractTrainer available

2010-08-13 Thread zdenko podobny
Hello, I would like to announce new version 1.01 of pyTesseractTrainer - successor of tesseractTrainer.py Version 1.00 is identical with tesseractTrainer.py. Features: - visual editor of box file - layout of symbol from box fi

Re: No treatment for touching letters?

2010-08-13 Thread SteveP
I looked at your image. I do not know the answer if your raw image from the scanner looks like this. If you are preprocessing the image before passing it to Tesseract, it looks like you are converting too many pixels from gray or white to black. If the raw image has white or gray pixels between

Re: No treatment for touching letters?

2010-08-13 Thread Jimmy O'Regan
2010/8/13 రాకేశ్వర రావు : > actually I have had this problem of symbols being chopped up. > I do not blame the engine though. Some symbols in my language are twice or > thrice that of the average. > Eg:- మూ [muu] is almost thrice in length as రీ[rii]. Similar problem must be > there in English with

Re: No treatment for touching letters?

2010-08-13 Thread రాకేశ్వర రావు
actually I have had this problem of symbols being chopped up. I do not blame the engine though. Some symbols in my language are twice or thrice that of the average. Eg:- మూ [muu] is almost thrice in length as రీ[rii]. Similar problem must be there in English with m , i etc. Unfortunately for me the