When multiprocessing using V4 (and TessAPI), I had to make multiple copies of tessdata, and give each worker with a unique tessdata.
Now it works okay. Hope this is helpful. From: tesseract-ocr@googlegroups.com [mailto:tesseract-ocr@googlegroups.com] On Behalf Of ignas...@gmail.com Sent: 28 August 2018 14:40 To: tesseract-ocr <tesseract-ocr@googlegroups.com> Subject: [tesseract-ocr] Tesseract 3.x multiprocessing weird behaviour I am not sure whether it is my infrastructure that does this weird stuff or the tesseract-ocr itself. Whenever i use image_to_string in single-process environment - the tesseract-ocr works fine. But when I spawn multiple workers with gunicorn and all of them get to do some work with ocr reading - the tesseract-ocr starts reading very poorly (and not from performance-vise, but accuracy-vise). Even after the load is done - tesseract never has the same accuracy. I need to restart all the workers in order to get tesseract working well again. This is super weird. Maybe anyone has experienced or heard of this issue? -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com<mailto:tesseract-ocr+unsubscr...@googlegroups.com>. To post to this group, send email to tesseract-ocr@googlegroups.com<mailto:tesseract-ocr@googlegroups.com>. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/3b1859ad-5c26-4688-b5e6-ceb7ae984c8f%40googlegroups.com<https://groups.google.com/d/msgid/tesseract-ocr/3b1859ad-5c26-4688-b5e6-ceb7ae984c8f%40googlegroups.com?utm_medium=email&utm_source=footer>. For more options, visit https://groups.google.com/d/optout. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/420a8d01089e47a191e498c889eeada8%40eesm.com. For more options, visit https://groups.google.com/d/optout.