[tesseract-ocr] Tesseract 3.x multiprocessing weird behaviour

2018-08-27 Thread ignasdov
I am not sure whether it is my infrastructure that does this weird stuff or the tesseract-ocr itself. Whenever i use image_to_string in single-process environment - the tesseract-ocr works fine. But when I spawn multiple workers with gunicorn and all of them get to do some work with ocr re

Re: [tesseract-ocr] IF I could make .unicharset by box/tif pairs instead of fonts files by tesstrain.sh?

2018-08-27 Thread Shree Devi Kumar
When using tesstrain.sh, you can add --save_box_tiff to the command line. Original tesstrain.sh did not move box/tiff alongwith lstmf files (they remained in /tmp directory). I had modified it first to move box/tiff in all cases along with lstmf files. This option now gives the user the choice w

[tesseract-ocr] IF I could make .unicharset by box/tif pairs instead of fonts files by tesstrain.sh?

2018-08-27 Thread 王思远
I see there is a new flag in the tesseract /src /training /tesstrain.sh in the change on 2018/8/20. add variable --save_b