[tesseract-ocr] text2image not found after running tesstrain.sh command

2021-08-23 Thread Samruddhi Dhake
Hello, I am creating my own trainneddata using tesseract v4.1.1 on Windows 10. I am referring documentation https://tesseract-ocr.github.io/tessdoc/tess4/TrainingTesseract-4.00.html I have successfully created .box file and .lstmf file using lstmbox and lstm.train respectively. So next step, I

Re: [tesseract-ocr] text2image not found after running tesstrain.sh command

2021-08-23 Thread Zdenko Podobny
How did you install tesseract? Did you also install training tools? Zdenko po 23. 8. 2021 o 15:34 Samruddhi Dhake napísal(a): > Hello, > > I am creating my own trainneddata using tesseract v4.1.1 on Windows 10. > I am referring documentation > https://tesseract-ocr.github.io/tessdoc/tess4/Trai

[tesseract-ocr] Exploring the Possibility of training Tesseract to recognize handwritten source code.

2021-08-23 Thread Rajeev Kodippily
Hi Folks, I'm doing a study on handwritten source code recognition. My goal is to convert *handwritten source code* ( obtained using stylus input, or it could be a written image if that is better suited for tesseract) to typeset. I would like to explore any support tesseract may have for thi

Re: [tesseract-ocr] Quality of Fraktur OCR too bad, any mistake on my side?

2021-08-23 Thread Merlijn B.W. Wajer
Hi Andreas, Using a newer data file and a newer Tesseract might help - see inline. On 28/07/2021 18:17, Andreas Groß wrote: > I work on Kubuntu 20.04 with gImageReader 3.3.1 () and tesseract 4.1.1 > and had installed fracture model with this command > > sudo apt-get install tesseract-ocr-script-

Re: [tesseract-ocr] Quality of Fraktur OCR too bad, any mistake on my side?

2021-08-23 Thread Andreas Groß
you are right. A better data file does the job well. I had documented my solution for the problem here: https://forum.ubuntuusers.de/topic/qualitaet-der-fraktur-ocr-zu-schlecht-ein-fehl/ On Tuesday, August 24, 2021 at 6:21:22 AM UTC+2 Merlijn Wajer wrote: > Hi Andreas, > > Using a newer data fil

Re: [tesseract-ocr] text2image not found after running tesstrain.sh command

2021-08-23 Thread Samruddhi Dhake
I have generated training tools through source code. On Monday, August 23, 2021 at 7:09:02 PM UTC+5:30 zdenop wrote: > How did you install tesseract? Did you also install training tools? > > Zdenko > > > po 23. 8. 2021 o 15:34 Samruddhi Dhake napísal(a): > >> Hello, >> >> I am creating my own tr