[tesseract-ocr] Re: v4.1.1 - Segmentation fault on train data generation; all .lstmf files are exactly 1GB

2021-09-22 Thread Sim Tov
Maybe it is just a bug I need to open an issue? On Monday, September 20, 2021 at 2:52:18 PM UTC+3 Sim Tov wrote: > Hello, > > I use v4.1.1 on Linux (Debian 11) and try to generate train and evaluate > data. The commands I used were: > > train: > > usr/share/tesseract-ocr/tesstrain.sh --fonts_dir

Re: [tesseract-ocr] Spaces recognition

2021-09-22 Thread Zdenko Podobny
Please also send the input image and code. Zdenko st 22. 9. 2021 o 7:27 Julio Hidalgo napísal(a): > Hi guys > > I am using tesseract-OCR to read text from image, the issue is that it > does not recognize the spaces between words. > > I am using C++, can anybody provide any advise on how to ove

Re: [tesseract-ocr] Re: v4.1.1 - Segmentation fault on train data generation; all .lstmf files are exactly 1GB

2021-09-22 Thread Zdenko Podobny
And what about testing the latest code? "tesstrain.sh" training is not supported anymore, and for creating issues you must use the latest code anyway. Zdenko st 22. 9. 2021 o 9:20 Sim Tov napísal(a): > Maybe it is just a bug I need to open an issue? > > On Monday, September 20, 2021 at 2:52:18

[tesseract-ocr] Re: lstmtraining query

2021-09-22 Thread Samruddhi Dhake
Hi, Can anyone help me to resolve above issues? Regards, Samruddhi On Thursday, September 16, 2021 at 2:58:50 PM UTC+5:30 Samruddhi Dhake wrote: > > Hi, > One more question to add here is, after running 2nd command mentioned > above, I am getting assert in file lstmtrainer.h, but I didn't find