Looking for someone to take over tesseractTrainer.py

2010-08-01 Thread Catalin Francu
Hi everyone, I am the author of a little script called tesseractTrainer.py. I wrote it a few years ago to facilitate training Tesseract for a project I was working on at the time. Since then, I am no longer working on the project, but I still occasionally receive questions, feature requests and e

Re: California License Plate font issues with OCR

2010-08-01 Thread Andres
2010/8/1 ZIA > Thanks Andre for finding the font. I will see how can i use that. As > you suggested using coreldraw, i don't have this software, i will try > to see if i can use some other software like MS word. > You are welcome. Things will be very hard with MS Word. Try better to get Corel or

Re: California License Plate font issues with OCR

2010-08-01 Thread ZIA
Thanks Andre for finding the font. I will see how can i use that. As you suggested using coreldraw, i don't have this software, i will try to see if i can use some other software like MS word. I was asking how to extract license plate from image. What I am doing, i get the image, re-sized, convert

Re: Improving accuracy on Tesseract 3.0 (also Issue 265)

2010-08-01 Thread Jimmy O'Regan
2010/8/1 Zdenko Podobný : > > Dňa 28.07.2010 17:02, Jimmy O'Regan wrote / napísal(a): >> > I grepped the code and it seems to be looking for something called > LANG.user-words, but that didn't seem to do anything -- I got the same > garbled text when I ran Tesseract 3 the second time. >

Re: Improving accuracy on Tesseract 3.0 (also Issue 265)

2010-08-01 Thread Zdenko Podobný
Dňa 28.07.2010 17:02, Jimmy O'Regan wrote / napísal(a): > I grepped the code and it seems to be looking for something called LANG.user-words, but that didn't seem to do anything -- I got the same garbled text when I ran Tesseract 3 the second time. >> Turns out T3 does

Re: Can't get the user dictionary to work

2010-08-01 Thread Jimmy O'Regan
2010/8/1 Zdenko Podobný : > Dňa 30.07.2010 15:04, patrickq  wrote / napísal(a): > > This what I did: > > 1. Created a text file called eng.user-words, containing: > Chest > Chestnut > Floor > Vice > > 2. Placed the file in the tessdata folder (next to eng.traineddata) > > 3. Ran recognition on an i

Re: Can't get the user dictionary to work

2010-08-01 Thread Zdenko Podobný
Dn(a 30.07.2010 15:04, patrickq wrote / napísal(a): > This what I did: > > 1. Created a text file called eng.user-words, containing: > Chest > Chestnut > Floor > Vice > > 2. Placed the file in the tessdata folder (next to eng.traineddata) > > 3. Ran recognition on an image returning "Chesf" instea

Re: Open Source OCR system

2010-08-01 Thread Bikash Bag
hi, I am also working on oriya OCR, can u please share your procedure of recognizing words or letters. regards, bikash On 1 August 2010 12:35, Sriranga(77yrsold) wrote: > Dear Rakesh, > Really interesting. Please don't forget me I like to join with you in > developing OCR for indian languages

Re: Open Source OCR system

2010-08-01 Thread Sriranga(77yrsold)
Dear Rakesh, Really interesting. Please don't forget me I like to join with you in developing OCR for indian languages under your leadership. Yes complexity existed as well as fundamental grammar in Indian languages based on Sanskrit only. I can also contribute Kannada tif image with its text con