[tesseract-ocr] Train Tesseract for Number plate

2019-04-07 Thread Gaurav Sharma
Hi Authors, I want to train tesseract ocr for my number plate and get output as num.traineddata. so that i can use it to recognize number plate. i have extracted number plate images from it i want to train my tesseract ocr. Please look into that any help would be appreciated. Tesseract versio

Re: [tesseract-ocr] Training Tesseract 4 from Scratch

2019-04-07 Thread shree
The script given is a simple example which will work for English and other Latin script languages. Check for errors and review output files at every stage. If you are training for Indic scripts, you need to use Norm_Code 2 and pass_through_recoder. RTL languages need further modifications. #=

Re: [tesseract-ocr] How to train tesseract with new script?

2019-04-07 Thread Shree Devi Kumar
Tesseract 4 LSTM training is done using tesseract, not tensowflow. It is easiest to train using synthetic training data generated with training text and fonts. For ancient scripts it may need to be finetuned further using real life images. I have tried training for Brahmi, Akkadian Cueniform and

Re: [tesseract-ocr] How to train tesseract with new script?

2019-04-07 Thread Moni
Thanks for your valuable response Since the scripts doesn't have the trained data, trying to generate the trained data. For creating trained data, whether have to use tensorflow or tesseract for training??? Thanks for taking time off your busy schedule... On Fri, Apr 5, 2019 at 4:01 PM Soumik Ra

[tesseract-ocr] Questions about recognize Chinese characters

2019-04-07 Thread 易鑫
Hello,everyone: Good day!I have trained a chi_sim model to recognize the Chinese characters.You can find the sample image in the attach file. I find that the two Chinese characters are a little connected and the image is very very clear. But tesseract regarded as one Chinese character ,s

Re: [tesseract-ocr] Tesseract on VS

2019-04-07 Thread 易鑫
I am using Tesseract on Visual Studio 2017 Shobhit Kapil 于2019年4月5日周五 下午10:29写道: > Hi All, > > Is there anyone who is using Tesseract on windows using Visual Studio. > > If yes i will be having few questions to ask. > > > Thanks, > Shobhit > > -- > You received this message because you are subs

Re: [tesseract-ocr] How to train tesseract with ancient Greek character

2019-04-07 Thread 易鑫
thanks a lot.I will try. Shree Devi Kumar 于2019年4月4日周四 下午10:05写道: > You don't need to add *"GFS Artemisia" as it may not have the Chinese > characters.* > > Just add Greek character "Φ" to your training text. > I think all fonts that you are using support it. > Verify in generated tif files th

Re: [tesseract-ocr] Tesseract different output on windows then linux

2019-04-07 Thread Chirs Masselli
SOLVED I solved it by downloading the 32 bit setup, it also fixed the bad recognition on windows vs linux without swithcing the train data 32bit linked I used from the wiki https://digi.bib.uni-mannheim.de/tesseract/tesseract-ocr-w32-setup-v4.1.0.20190314.exe On Sunday, April 7, 2019 at 5:39:11

Re: [tesseract-ocr] Tesseract different output on windows then linux

2019-04-07 Thread stjoweil via tesseract-ocr
The assertion which you get was removed in commit dc8745e6fd4c6c070076c44565924faa0d0643a7 two years ago, so you are using an outdated version of Tesseract which is no longer supported. Use `tesseract --version` to see the version of your installed Tesseract. -- You received this message becau

Re: [tesseract-ocr] Training Tesseract 4 from Scratch

2019-04-07 Thread Trong
Thank you, Shree. I placed my trained data file to my dir. It works. Thank you very much! -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubs

Re: [tesseract-ocr] Training Tesseract 4 from Scratch

2019-04-07 Thread Shree Devi Kumar
*mgr_.Init(traineddata_path.c_str()):Error:Assert failed:in file ../../src/lstm/lstmtrainer.h, line 110* Your traineddata file path is incorrect or file does not exist On Sun, Apr 7, 2019 at 2:22 PM Trong wrote: > Hi, > I tried to train and got error > > *mgr_.Init(traineddata_path.c_str()):Er

Re: [tesseract-ocr] Training Tesseract 4 from Scratch

2019-04-07 Thread Trong
Hi, I tried to train and got error *mgr_.Init(traineddata_path.c_str()):Error:Assert failed:in file ../../src/lstm/lstmtrainer.h, line 110Illegal instruction (core dumped)* Are there any problems in my enviroiment ? OS: Ubuntu 18.04 64bit. Others are in below element. Thanks, titi@Ubun18:~/t

Re: [tesseract-ocr] Tesseract different output on windows then linux

2019-04-07 Thread Shree Devi Kumar
If problem is with the new windows version, please file issue at https://github.com/UB-Mannheim/tesseract/issues as the maintainers might not be checking this forum often. On Sun, Apr 7, 2019 at 11:33 AM Chirs Masselli wrote: > Yes that's the exact one I used. > > On Sunday, April 7, 2019 at 1:5