Re: [tesseract-ocr] lstmeval gives a perfect result but tesseract fails

2018-05-31 Thread Julien Jemine
Hi Shree, Thanks for your answer. If you don't mind, could you explain why it'd be better ? Le jeudi 31 mai 2018 17:25:47 UTC+2, shree a écrit : > > >I've trained a LSTM model for a custom language from scratch as explained > here >

Re: [tesseract-ocr] Not able install tesseract ocr on ubuntu 17.04

2018-05-31 Thread RT-Rakesh
Hi ShreeDevi, Thanks for your response. I am still getting this error when trying with the command that you shared. Please assist me how to go about here. Thank you very much. user@computer:~$ sudo apt install tesseract-ocr Reading package lists... Done Building dependency tree Readin

[tesseract-ocr] Re: Training error "Couldn't find a matching blob"

2018-05-31 Thread shree
This has been an issue for long. Thanks for finding the problem. Please submit a PR on github. On Friday, June 1, 2018 at 1:55:25 AM UTC+5:30, Paul Kitchen wrote: > > After a lot of stepping through tesseract code, I found the problem. > > 1) In file coutln.cpp, function C_OUTLINE::IsLegall

[tesseract-ocr] Re: Training error "Couldn't find a matching blob"

2018-05-31 Thread Paul Kitchen
After a lot of stepping through tesseract code, I found the problem. 1) In file coutln.cpp, function C_OUTLINE::IsLegallyNested(), we assign outer_area() to an inT32, parent_area. Then lower in the function, we multiple child->outer_area() by parent_area. This caused an integer overflow w

[tesseract-ocr] Preprocess Image

2018-05-31 Thread Hongguo An
Hi: When trying to OCR the above image, the date 09/02/2017 is always wrong, (0G/02/2017). This is tesseract 4 running on linux, the cmd line is: *tesser

[tesseract-ocr] Re: Not able install tesseract ocr on ubuntu 17.04

2018-05-31 Thread Александр Поздняков
You need to replace in */etc/apt/sources.list* repository *http://us.archive.ubuntu.com/ubuntu* at *http://old-releases.ubuntu.com/ubuntu/* sudo apt-get update sudo apt install tesseract-ocr As for the beta version, I'll think ... четверг, 31 мая 2018 г., 10:04:19 UTC+3 пользователь RT-Rakesh

[tesseract-ocr] Syriac traineddata

2018-05-31 Thread George Kiraz
The Syriac Institute [www.bethmardutho.org] is looking for a specialist for training/improving Syriac traineddata for Tesseract 4 to contribute to an OCR project. Anyone interested can contact us offline. Thanks. -- You received this message because you are subscribed to the Google Groups "te

Re: [tesseract-ocr] lstmeval gives a perfect result but tesseract fails

2018-05-31 Thread ShreeDevi Kumar
>I've trained a LSTM model for a custom language from scratch as explained here . >The language only has about 100 words and 17 characters, so it's pretty simple. For such a small model, try to build the legacy version rathe

Re: [tesseract-ocr] Help for training tesseract to recognize a new (dead) language

2018-05-31 Thread Ramast Magdy
Impressive! I thought we would need to do a lot of work in order to reach that stage??. The "??" in the text correspond to an unknown character to me, I also can't find it among the available unicode characters. It's certainly 100% not part of the text. Probably indicator of new chapter. Maybe

[tesseract-ocr] lstmeval gives a perfect result but tesseract fails

2018-05-31 Thread Julien Jemine
Hi, I've trained a LSTM model for a custom language from scratch as explained here . The language only has about 100 words and 17 characters, so it's pretty simple. When I run lstmeval on my model, I get a perfect match:

Re: [tesseract-ocr] Not able install tesseract ocr on ubuntu 17.04

2018-05-31 Thread ShreeDevi Kumar
Remove the existing version, then sudo add-apt-repository ppa:alex-p/tesseract-ocr sudo apt-get update sudo apt install tesseract-ocr ShreeDevi भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com On Thu, May 31, 2018 at 12:29 PM, RT

Re: [tesseract-ocr] Where to find tessdata folder?

2018-05-31 Thread Zdenko Podobny
Did you follow instruction for installation of that package? Did you try internet search before posting on forum? Did you try to search for help in project tesserocr??? I just put it to google and I got: https://pypi.org/project/tesserocr/ https://github.com/sirfz/tesserocr https://oded.blog/2017

[tesseract-ocr] Not able install tesseract ocr on ubuntu 17.04

2018-05-31 Thread RT-Rakesh
user@computer:~$ sudo apt install tesseract-ocr Reading package lists... Done Building dependency tree Reading state information... Done The following packages were automatically installed and are no longer required: libgnutls-openssl27 postfix-sqlite Use 'sudo apt autoremove' to remove t