[tesseract-ocr] Re: I'm reading Using tesstrain (tesseract 4.0) wiki passage _ I have a question

2018-02-28 Thread 이경준
Sorry . But I have issue about korea you mentioned answer is applyed to english . But , it doesn't work korea In the logs . Font error . But I refer to the /training/language-specific.sh vi language-specific.sh Font list - kor _NeoLatin so I install korean fonts in there . and reboot but

Re: [tesseract-ocr] Re: I'm reading Using tesstrain (tesseract 4.0) wiki passage _ I have a question

2018-02-28 Thread ShreeDevi Kumar
Try with following - make sure that you change all variables with dir to match your setup tesstrain.sh \ --lang kor \ --noextract_font_properties \ --linedata_only \ * --langdata_dir ../langdata \* * --tessdata_dir ../tessdata \* * --fonts_dir **/mnt/c/Windows/Fonts** \* --fontlist \ "Arial

[tesseract-ocr] Differentiate "I" and "|" in Tesseract.

2018-02-28 Thread adarsh
I want help to train my tesseract to differentiate similar characters like "I", "l" and "|". There are errors at some places in the pdf. I hope that someone helps. Thanks in advance. Adarsh -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. T

[tesseract-ocr] Re: Differentiate "I" and "|" in Tesseract.

2018-02-28 Thread adarsh
images are attached for further reference: On Wednesday, February 28, 2018 at 5:00:15 PM UTC+5:30, ada...@turningcloud.com wrote: > > I want help to train my tess

[tesseract-ocr] Training Tesseract, results unexpected

2018-02-28 Thread Alex Ortega Muñoz
Hi, I'm trying to use Tesseract to recognise some characters from medical images, and I used JtessBoxEditor to edit box file and adapt it to errors in this image. But when I run tesseract after modify that box whit correct chars, it keeps making an output with previous box values. I don't know

Re: [tesseract-ocr] Re: I'm reading Using tesstrain (tesseract 4.0) wiki passage _ I have a question

2018-02-28 Thread 이경준
Thank U reply my question. But my system is operated by Ubuntu 16.04. 03 LTS I think that that path is not working ? Am I false? 2018년 2월 28일 수요일 오후 6시 18분 41초 UTC+9, shree 님의 말: > > Try with following - make sure that you change all variables with dir to > match your setup > > tesstrain.s

Re: [tesseract-ocr] Re: I'm reading Using tesstrain (tesseract 4.0) wiki passage _ I have a question

2018-02-28 Thread ShreeDevi Kumar
On Thu, Mar 1, 2018 at 9:21 AM, 이경준 wrote: > Thank U reply my question. > > But my system is operated by Ubuntu 16.04. 03 LTS > > I think that that path is not working ? Am I false? > > > 2018년 2월 28일 수요일 오후 6시 18분 41초 UTC+9, shree 님의 말: >> >> Try with following - make sure that you change all v

[tesseract-ocr] I have a question about making a traineddata (tesseract 4.0 LSTM)

2018-02-28 Thread 이경준
Hi I have a question about making a traineedata (tesseract 4.0 LSTM) Tutorial Guide to lstmtraining Creating Starter Traineddata NOTE: This is a new step! Instead of a unicharset and script_

[tesseract-ocr] I have a qeustion about font_properties(tesseract 4.0)

2018-02-28 Thread 이경준
Hi I have a question about font_properties(tesseract 4.0) https://github.com/tesseract-ocr/langdata/blob/master/font_properties (e.g) Baekmuk_Dotum 0 0 0 0 0 here digits means that is right ? I cite this words from https://github.com/tesseract-ocr/tesseract/wiki/Training-Tesserac

Re: [tesseract-ocr] Re: I'm reading Using tesstrain (tesseract 4.0) wiki passage _ I have a question

2018-02-28 Thread 이경준
Yes .I tried tessdata - kor.trainnedata /// But it is not good enough. sorry .ㅜㅜ i can not use tesseract 4.0 tessdata-kor.trainnedata. in bussiness .. So I must train 4.00 kor ... Thank you for advice 2018년 3월 1일 목요일 오후 12시 59분 31초 UTC+9, shree 님의 말: > > > On Thu, Mar 1, 2018 at 9:21 AM, 이경준 >

Re: [tesseract-ocr] Re: I'm reading Using tesstrain (tesseract 4.0) wiki passage _ I have a question

2018-02-28 Thread ShreeDevi Kumar
> my system is operated by Ubuntu 16.04. 03 LTS > Yes .I tried tessdata - kor.trainnedata /// But it is not good enough. sorry .ㅜㅜ i can not use tesseract 4.0 tessdata-kor.trainnedata. in bussiness .. I will suggest that you uninstall your old tesseract version.(3.0x) sudo apt-get remove tesser

Re: [tesseract-ocr] Re: I'm reading Using tesstrain (tesseract 4.0) wiki passage _ I have a question

2018-02-28 Thread 이경준
Thank U . for advice I have never installed tesseract (3.0x) I have a question your last command line means that install language pack in tessdata directory - kor.traineddata Am I false. I want to say I use that way. but, my test image recognizision rate is not enough to use the business

Re: [tesseract-ocr] Re: I'm reading Using tesstrain (tesseract 4.0) wiki passage _ I have a question

2018-02-28 Thread ShreeDevi Kumar
>we don't understand each otehr saying. Sorry about that. Please give the following commands and let me know the result. tesseract -v tesseract --list-langs combine_tessdata -u kor.traineddata I do not know Korean, but feedback from other users has been that tesseract4 and the latest traineda