On Thu, Mar 1, 2018 at 9:21 AM, 이경준 <playerjune...@gmail.com> wrote:
> Thank U reply my question. > > But my system is operated by Ubuntu 16.04. 03 LTS > > I think that that path is not working ? Am I false? > > > 2018년 2월 28일 수요일 오후 6시 18분 41초 UTC+9, shree 님의 말: >> >> Try with following - make sure that you change all variables with dir to >> match your setup >> >> tesstrain.sh \ >> --lang kor \ >> --noextract_font_properties \ >> --linedata_only \ >> * --langdata_dir ../langdata \* >> * --tessdata_dir ../tessdata \* >> * --fonts_dir **/mnt/c/Windows/Fonts** \* >> --fontlist \ >> "Arial Unicode MS" \ >> * --output_dir ../tesstutorial/kor* >> >> The fontlist you specify in command will override the list in >> language_specific.sh >> >> >> Tesseract4.00alpha gives good results for Korean recognition. Have you tried that? You may not need to do training. If you want to do training for 4.00, you need files from langdata and tessdata_best. https://github.com/tesseract-ocr/langdata https://github.com/tesseract-ocr/ tessdata_best see https://github.com/tesseract-ocr/langdata/blob/master/README.md -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduX5UAgj5Y2MLsgK_aL4S5Xw-QEOpw3xC9xDPMEWuDtS-g%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.