[tesseract-ocr] Re: Read Local Charter (Hindi , Tamil, Sinhala)

2018-03-02 Thread Aruna Gamage
I used this repov = https://github.com/rmtheis/android-ocr and i try to change traindata file to sin.traindata file but it not recognize Sinhala font. pls help me to configure my project. > > Thanks > Aruna > -- You received this message because you are subscribed to the Google Group

Re: [tesseract-ocr] Re: Read Local Charter (Hindi , Tamil, Sinhala)

2018-03-02 Thread ShreeDevi Kumar
Please post issue in the appropriate repository ie https://github.com/rmtheis/android-ocr Sinhala language can be recognized using latest version of tesseract and traineddata from tessdata_fast repo. Please close this issue. ShreeDevi __

[tesseract-ocr] Change language Sinhala

2018-03-02 Thread Aruna Gamage
Dear Sir, I need to detect Sinhala language. I have change trainddata file to sin.traindata file but not detect Sinhala characters. i'm used this repo = https://github.com/rmtheis/android-ocr pls tell me what i have 't edited that Project. Regards Aruna -- You received this message becau

[tesseract-ocr] Has anybody use tesseract to train and recognize handwritten ?

2018-03-02 Thread Wayne
Has anybody use tesseract to train and recognize handwritten ? I use tesseract to train handwritten with MNIST database, but I get worse accuracy. How to improve it? -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this gro

[tesseract-ocr] Tesseract

2018-03-02 Thread Alexandre Juca
Do the Google Vision APIs use Tesseract internally? Are there any documents that can help me understand how the Vision API does OCR internally? -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving

[tesseract-ocr] tesseract data files

2018-03-02 Thread Simon Eigeldinger
Hi all, Just looked at the git commits for tesseract and read that there has been changes to the OCR modes. are the 3 tessdata sets still valid? tessdata_fast and tessdata_best have been updated so i guess those reflect the latest developments but tessdata hasn't an update since september. i

Re: [tesseract-ocr] tesseract data files

2018-03-02 Thread ShreeDevi Kumar
Hi Simon, If you are planning to package using 4.00alpha from master branch, please use traineddata files from tessdata_fast. These are the files that have been shipped for Ubuntu 18.04 and included in Debian. See https://github.com/tesseract-ocr/tesseract/wiki for some links. You can update the

Re: [tesseract-ocr] tesseract data files

2018-03-02 Thread ShreeDevi Kumar
> tessdata repo supports both --oem 0 and --oem 1, but the files are older and may NOT be fully compatible with current code. The results may vary depending on language and oem used. I have NOT tested this much, since newer traineddata give better accuracy for Indian languages. ShreeDevi ___