Re: [tesseract-ocr] Android app using Tesseract v4 for OCR

2019-03-30 Thread Soumik Ranjan Dasgupta
Please update if you complete it and decide to upload on playstore. On Sat, Mar 30, 2019 at 11:58 AM prerna kumari wrote: > TextFairy is there and I'm also working on an android application using > tesseract. > > On Sat 30 Mar, 2019, 11:47 AM Soumik Ranjan Dasgupta, < > srd1...@cse.jgec.ac.in> w

Re: [tesseract-ocr] Training a language not in tesseract but almost similar script/ letters with vietnam language

2019-03-30 Thread haruo195k
Thank you for the response. I tried by keeping the bazaar at the end and the command runs without any error. However, tesseract is still not able to recognize the extra letters that I have provided in the *tessedit_char_whitelist, *the output is same. The words/ text is in the image is already

Re: [tesseract-ocr] Trainning tesseract for a new language from scratch that does not exist in Tesseract

2019-03-30 Thread haruo195k
Hi, you might have got confused with my other question. I am actually working on two languages. Neither of them are currently present in Tessseract. While one of them has somewhat similar script/ letters with vie. This one has no connection/ totally different with any of the language currentl

Re: [tesseract-ocr] Trainning tesseract for a new language from scratch that does not exist in Tesseract

2019-03-30 Thread Shree Devi Kumar
jtessboxeditor offers tesseract training for version 3.0x that's why I mentioned it. For tesseract4, training steps are very different. On Sat, Mar 30, 2019 at 1:14 PM wrote: > Hi, you might have got confused with my other question. I am actually > working on two languages. Neither of them are

[tesseract-ocr] How do I increase the accuracy in this situation

2019-03-30 Thread 童虎
I have a image like this: [image: 111.jpg] And I then run tesseract 111.jpg out -l chi_sim; cat out.txt;rm out.txt; But the result is Tesseract Open Source OCR Engine v4.0.0-332-gb727 with Leptonica E I have no idea how to improve this. Any ideas? Thank you. -- You received this message be

[tesseract-ocr] What is the current stable version of tesseract? And how to upgrade it from tesseract 3.04.01?

2019-03-30 Thread haruo195k
I have installed tesseract using the following command: *sudo apt install tesseract-ocr* on Ubuntu 16.04 LTS. Working with python 2.7 and 3.5 Tesseract current version is showing as: tesseract --version *tesseract 3.04.01 leptonica-1.73 libgif 5.1.2 : libjpeg 8d (libjpeg-turbo 1.4.2) :

Re: [tesseract-ocr] What is the current stable version of tesseract? And how to upgrade it from tesseract 3.04.01?

2019-03-30 Thread Shree Devi Kumar
You can use Alex's PPA https://launchpad.net/~alex-p/+archive/ubuntu/tesseract-ocr?field.series_filter=xenial On Sat, Mar 30, 2019 at 4:43 PM wrote: > I have installed tesseract using the following command: > >*sudo apt install tesseract-ocr* > > on Ubuntu 16.04 LTS. Working with python 2

Re: [tesseract-ocr] How do I increase the accuracy in this situation

2019-03-30 Thread Zdenko Podobny
https://github.com/tesseract-ocr/tesseract/wiki/ImproveQuality Zdenko so 30. 3. 2019 o 9:50 童虎 napísal(a): > I have a image like this: > > [image: 111.jpg] > And I then run > tesseract 111.jpg out -l chi_sim; cat out.txt;rm out.txt; > But the result is > > > Tesseract Open Source OCR Engine v4

Re: [tesseract-ocr] How to restrict OCR character set.

2019-03-30 Thread Martin Emmerson
Thanks! This may still be a stretch for my current level of tesseract knowledge but definitely more within reach! I look forward to giving it a try. On Friday, March 29, 2019 at 11:12:44 PM UTC-7, shree wrote: > > This was finetuned with 20+ monospaced fonts for 400 iterations to error > ra

[tesseract-ocr] Tesseract detecting 6 as 5.

2019-03-30 Thread steve
[image: Frame234PixImageBinary.jpg] This seems like a pretty good 6 even though a little ragged looking. Why does tesseract detect this as a 5? What type of processing should be done to help the situation? I am using version 3.02.02 Windows 32 bit build. -- You received this message because

Re: [tesseract-ocr] Tesseract detecting 6 as 5.

2019-03-30 Thread Zdenko Podobny
tesseract 3.02 is very very very old version. Try to upgrade. Zdenko so 30. 3. 2019 o 19:31 napísal(a): > [image: Frame234PixImageBinary.jpg] > > This seems like a pretty good 6 even though a little ragged looking. Why > does tesseract detect this as a 5? What type of processing should be don

Re: [tesseract-ocr] Trainning tesseract for a new language from scratch that does not exist in Tesseract

2019-03-30 Thread haruo195k
Ok. thanks. Could you guide me on how to train in tesseract 4? On Saturday, March 30, 2019 at 1:28:19 PM UTC+5:30, shree wrote: > > jtessboxeditor offers tesseract training for version 3.0x that's why I > mentioned it. > > For tesseract4, training steps are very different. > > On Sat, Mar 30,

[tesseract-ocr] Tesseract not performing well even after apply Binarisation and create border of 10 pixel

2019-03-30 Thread Abhay Garg
I am using Tesseract 4 and it's working fine but in some cases it is not performing well. Example: Orignal Image [image: roi_o1.png] Image After apply Binarisation and 10 pix