Re: [tesseract-ocr] Re: Trying to add chars to tesseract 4.0

2017-12-15 Thread ShreeDevi Kumar
>>Thanks, I have read that new tesseract-ocr 4.0 doesn't use wordlist anymore. It meat for older version? is that right? New 4.0alpha version does not REQUIRE the wordlist, but uses it, if available, and the accuracy is improved based on the wordlist. So, basically, 4.0alpha will work without wor

Re: [tesseract-ocr] Re: Trying to add chars to tesseract 4.0

2017-12-15 Thread Fahad Al-Saidi
Thanks, I have read that new tesseract-ocr 4.0 doesn't use wordlist anymore. It meat for older version? is that right? On Fri, Dec 15, 2017 at 12:13 PM, shree wrote: > > > On Friday, December 8, 2017 at 5:46:01 PM UTC+5:30, Fahad Al-Saidi wrote: >> >> >> I have the same problem, why not the new

[tesseract-ocr] Re: Trying to add chars to tesseract 4.0

2017-12-15 Thread shree
On Friday, December 8, 2017 at 5:46:01 PM UTC+5:30, Fahad Al-Saidi wrote: > > > I have the same problem, why not the new fine tuned traineddata include > the old wordlist? It suppose to do so. I followed the instructions in the > wiki but I got the same issue. Any help? > If you want the wordl

Re: [tesseract-ocr] Re: Trying to add chars to tesseract 4.0

2017-12-08 Thread Fahad Al-Saidi
Great, then how I combine the wordlist into the new traineddata? It wiki page isn't clear about that. On Fri, Dec 8, 2017 at 4:29 PM, ShreeDevi Kumar wrote: > The langdata repository has not been updated by Ray for 4.0alpha. If you > want the same word list unpack the traineddata from tessdata

Re: [tesseract-ocr] Re: Trying to add chars to tesseract 4.0

2017-12-08 Thread ShreeDevi Kumar
The langdata repository has not been updated by Ray for 4.0alpha. If you want the same word list unpack the traineddata from tessdata repositories. Also read the last section of training wiki page re combining files. On 08-Dec-2017 5:45 PM, "Fahad Al-Saidi" wrote: > > On Wednesday, December 6,

[tesseract-ocr] Re: Trying to add chars to tesseract 4.0

2017-12-08 Thread Fahad Al-Saidi
On Wednesday, December 6, 2017 at 10:37:59 AM UTC+4, J Klein wrote: > > But this eng.traineddata was 5MB when the original one was 15.4MB. > I have the same problem, why not the new fine tuned traineddata include the old wordlist? It suppose to do so. I followed the instructions in the wiki bu

[tesseract-ocr] Re: Trying to add chars to tesseract 4.0

2017-12-07 Thread J Klein
As an addendum, is there an easy way to diagnose why my eng.traineddata won't load? All I have have is a -1 error in API Init3 I put it here:https://filebin.ca/3jvP3FKuvp4G/eng.traineddata in case anyone knows how to diagnose a bad eng.traineddata Thanks in advance for any tips! On Wedn