[tesseract-ocr] Re: Tesseract couldn't load any languages!

2018-05-17 Thread shree
It is possible that you have not downloaded eng.traineddata or it is in a different location. Try running tesseract on command line, check --list-langs. On Friday, May 18, 2018 at 9:27:59 AM UTC+5:30, Dattatraya Tembare wrote: > > > *[SOLVED] changed the language from 'hin+eng' to 'hin'In this c

[tesseract-ocr] Re: Tesseract couldn't load any languages!

2018-05-17 Thread Dattatraya Tembare
*[SOLVED] changed the language from 'hin+eng' to 'hin'In this case selection of language also matters -* I was processing image with lang=hin+eng, but it was giving the same error (mentioned in this post) As English text was less in the image so I changed lang=hin and I got the expected resu

[tesseract-ocr] Re: Tesseract couldn't load any languages!

2018-05-17 Thread Dattatraya Tembare
Thanks! Your solution worked. Now facing something different -- Same pattern 33 files executed successfully, failed for 34th file. java.lang.Error: Invalid memory access at com.sun.jna.Native.invokePointer(Native Method) ~[jna-4.5.1.jar:4.5.1 ( b0)] at com.sun.jna.Function.invokePointer(Functi

Re: [tesseract-ocr] Re: How can JTessBoxEditor generate lstm files ?

2018-05-17 Thread Quan Nguyen
Those .sh shell scripts would not run on Windows environment. You may need Cygwin or Windows Subsystem for Linux. Hope others who have experience on this will chime in. On Thursday, May 17, 2018 at 2:35:50 AM UTC-5, Fadi Fawzi wrote: > > Thanks Quan > But is there a simple way to do training

Re: [tesseract-ocr] tesseract version - Ubuntu 16.04 PPA vs compiling from tesseract-ocr github source (master-branch)

2018-05-17 Thread ShreeDevi Kumar
> Which traineddata (english) is installed when tesseract is installed using the Ubuntu PPA tessdata_fast > Is the Ubuntu PPA version in sync with the Github master branch? Not necessarily. But it should be pretty close, You can look at the commit number and date in the files at ppa. > Whi

[tesseract-ocr] tesseract version - Ubuntu 16.04 PPA vs compiling from tesseract-ocr github source (master-branch)

2018-05-17 Thread Pushkar Pandey
Hi All, Could someone answer the following questions I have? 1. Is the Ubuntu 16.04 PPA the latest tesseract version right from the GitHub master branch? Is the Ubuntu PPA version in sync with the Github master branch? 2. Which traineddata (english) is installed when tesseract is installed usin

Re: [tesseract-ocr] Re: How can JTessBoxEditor generate lstm files ?

2018-05-17 Thread Fadi Fawzi
Thanks Quan But is there a simple way to do training process on WINDOWS, or I must adhere to Linux (Ubuntu) ? On Tue, May 15, 2018 at 5:02 AM, Quan Nguyen wrote: > As of today, it supports only legacy training (i.e., 3.0x version). > > Training for 4.0x is described in the Training Wiki >