Re: [tesseract-ocr] tesseract produces one time bad one time good results

2019-07-18 Thread Shree Devi Kumar
Binarize and invert the images to get black text on white. I tried with latest code from master branch on github, gives correct results. tesseract 2-bw.png stdout --psm 6 --dpi 300 --tessdata-dir ~/tessdata --oem 1 --user-patterns ./timestamp.patterns.txt -c lstm_use_matrix=1 -c tessedit_char_whit

Re: [tesseract-ocr] tesseract produces one time bad one time good results

2019-07-18 Thread Claudiu
Can someone explain, what does lstm_use_matrix option do? On Thu, Jul 18, 2019 at 11:36 AM Shree Devi Kumar wrote: > Binarize and invert the images to get black text on white. I tried with > latest code from master branch on github, gives correct results. > > tesseract 2-bw.png stdout --psm 6 -

[tesseract-ocr] Re: VietOCR 5.0 Java & .NET Releases

2019-07-18 Thread Quan Nguyen
VietOCR v5.5.0 & VietOCR.NET v5.5.0 Releases A Java/.NET WPF GUI frontend for Tesseract OCR engine. The releases include the following improvements: - Upgrade to Tesseract 4.1.0 http://vietocr.sf.net > -- You received this message because you are subscribed to the Google Groups "tesseract-o

Re: [tesseract-ocr] Trained data for E13B font

2019-07-18 Thread ElGato ElMago
Hi, Let's call them phantom characters then. Was psm 7 the solution for the issue 1778? None of the psm option didn't solve my problem though I see different output. I use tesseract 5.0-alpha mostly but 4.1 showed the same results anyway. How did you get bounding box for each character? Alt

Re: [tesseract-ocr] Trained data for E13B font

2019-07-18 Thread Shree Devi Kumar
Please check out the recent commits in master branch https://github.com/tesseract-ocr/tesseract/pull/2554 On Fri, 19 Jul 2019, 10:55 ElGato ElMago, wrote: > Hi, > > Let's call them phantom characters then. > > Was psm 7 the solution for the issue 1778? None of the psm option didn't > solve my

[tesseract-ocr] Training stops before specified iterations

2019-07-18 Thread Pooja Kamra
In training comand, max iterations given are 1. But training stops after 4600 iterations. What can be reason for this. Regards, Pooja -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails

Re: [tesseract-ocr] Trained data for E13B font

2019-07-18 Thread Shree Devi Kumar
Also https://github.com/tesseract-ocr/tesseract/pull/2576 On Fri, 19 Jul 2019, 11:14 Shree Devi Kumar, wrote: > Please check out the recent commits in master branch > > https://github.com/tesseract-ocr/tesseract/pull/2554 > > On Fri, 19 Jul 2019, 10:55 ElGato ElMago, wrote: > >> Hi, >> >> Let's

Re: [tesseract-ocr] Training stops before specified iterations

2019-07-18 Thread Shree Devi Kumar
The target character error rate may have been achieved. On Fri, 19 Jul 2019, 11:14 Pooja Kamra, wrote: > In training comand, max iterations given are 1. But training stops > after 4600 iterations. > What can be reason for this. > > Regards, > Pooja > > -- > You received this message because

Re: [tesseract-ocr] Training stops before specified iterations

2019-07-18 Thread Pooja Kamra
Dear Shree, I have not specified target error rate. What eror rate will be taken as default. On Friday, July 19, 2019 at 11:17:12 AM UTC+5:30, shree wrote: > > The target character error rate may have been achieved. > > On Fri, 19 Jul 2019, 11:14 Pooja Kamra, > > wrote: > >> In training coman