You were right, I just found that my RPi is throttling. It explains the slowing down. Now I'm checking if heatsink could help. So I expect that there is nothing to tune up in my loop. I will check if I can try some smaller model.
best regards Jan Dne středa 30. ledna 2019 11:15:19 UTC+1 Lorenzo Blz napsal(a): > > Did you check this? > > > https://www.raspberrypi.org/forums/viewtopic.php?f=63&t=147781&start=50#p972790 > > Il giorno mer 30 gen 2019 alle ore 08:09 Jan Pohanka <[email protected] > <javascript:>> ha scritto: > >> I have already done that but haven't found anything interesting. >> I tried to ask here if there are eg. any part of algorithms that can be >> disabled etc. The image is preprocessed, binarized and contain only 8 >> digits (and point). I was also a bit surprised that resizing image from >> 400px to 50px has given only subtle speed up. >> >> I will try the fast model today (if I find how to switch it), maybe it >> will help. >> >> here are my measured times >> ocr time: 0.980876922607 >> ocr time: 0.435426950455 >> ocr time: 0.76907491684 >> ocr time: 0.836761951447 >> ocr time: 0.871710062027 >> ocr time: 0.803520917892 >> ocr time: 0.371052026749 >> ocr time: 0.732284069061 >> ocr time: 0.745162010193 >> ocr time: 0.836426019669 >> ocr time: 0.740739107132 >> ocr time: 0.379159927368 >> ocr time: 0.798940181732 >> ocr time: 0.3972260952 >> ocr time: 0.739762067795 >> ocr time: 0.7757999897 >> ocr time: 0.772871017456 >> ocr time: 0.435608863831 >> ocr time: 0.770547866821 >> ocr time: 0.870738983154 >> ocr time: 0.37126493454 >> ocr time: 0.837875127792 >> ocr time: 0.811723947525 >> ocr time: 0.865257024765 >> ocr time: 0.79048204422 >> ocr time: 0.435704946518 >> ocr time: 0.763910055161 >> ocr time: 0.391008853912 >> ocr time: 0.396636009216 >> ocr time: 0.38174700737 >> ocr time: 0.809095144272 >> ocr time: 0.773195028305 >> ocr time: 0.427488088608 >> ocr time: 0.403608083725 >> ocr time: 0.806233167648 >> ocr time: 0.948635101318 >> ocr time: 0.900885105133 >> ocr time: 0.829130887985 >> ocr time: 0.932774782181 >> ocr time: 1.09788799286 >> ocr time: 0.520708799362 >> ocr time: 0.448786973953 >> ocr time: 0.560626983643 >> ocr time: 0.993177175522 >> ocr time: 0.48442697525 >> ocr time: 1.1292309761 >> ocr time: 1.04695606232 >> ocr time: 0.8810338974 >> ocr time: 1.10285806656 >> ocr time: 1.05213713646 >> ocr time: 1.22593903542 >> ocr time: 1.04618191719 >> ocr time: 1.11645102501 >> ocr time: 1.05435395241 >> ocr time: 1.15162396431 >> ocr time: 0.547721862793 >> ocr time: 0.607867956161 >> ocr time: 1.14074802399 >> ocr time: 1.1790971756 >> ocr time: 1.18815803528 >> ocr time: 0.58503985405 >> ocr time: 1.10898280144 >> ocr time: 1.22723913193 >> ocr time: 1.2178709507 >> ocr time: 1.28540086746 >> ocr time: 1.28237104416 >> ocr time: 1.56176805496 >> ocr time: 1.2859480381 >> ocr time: 1.2599170208 >> ocr time: 1.42588591576 >> ocr time: 1.51333785057 >> ocr time: 1.34276986122 >> ocr time: 1.34283900261 >> ocr time: 1.39351201057 >> ocr time: 1.61450195312 >> ocr time: 1.44723105431 >> ocr time: 1.63176107407 >> ocr time: 0.82429599762 >> ocr time: 1.08239603043 >> ocr time: 0.755813121796 >> ocr time: 1.63984704018 >> ocr time: 1.84553313255 >> ocr time: 0.958009958267 >> ocr time: 1.52479290962 >> ocr time: 0.919597864151 >> >> thanks >> Jan >> >> Dne středa 30. ledna 2019 7:57:07 UTC+1 zdenop napsal(a): >>> >>> search issue tracker for "speed"... >>> >>> Zdenko >>> >>> >>> st 30. 1. 2019 o 7:51 Jan Pohanka <[email protected]> napísal(a): >>> >>>> It is 4.0. I'm satisfied with recognition results, but I need to make >>>> it faster (at constant times below 1s)... >>>> >>>> Dne středa 30. ledna 2019 7:48:23 UTC+1 zdenop napsal(a): >>>>> >>>>> What is your tesseract version? >>>>> >>>>> Zdenko >>>>> >>>>> >>>>> ut 29. 1. 2019 o 20:40 Jan Pohanka <[email protected]> napísal(a): >>>>> >>>>>> Thanks for suggestions. You are right that I'm reffering to >>>>>> api.GetUTF8Text() >>>>>> call, it is my bottleneck. >>>>>> I was not aware that there is some fast and best models in tesseract >>>>>> 4.0, I will give it a try. So far I used just lang=eng or osd. >>>>>> For me it is suspicious that the calls are getting longer during the >>>>>> time. Or to be more precise, first 10-15 calls are up to 500ms and >>>>>> latter >>>>>> ones rise above 1s... >>>>>> SetSourceResolution outside of the loop gives no change unfortunately. >>>>>> >>>>>> BR >>>>>> Jan >>>>>> >>>>>> Dne úterý 29. ledna 2019 18:08:49 UTC+1 Lorenzo Blz napsal(a): >>>>>>> >>>>>>> >>>>>>> First double check if the Pi is not throttling due to overheating or >>>>>>> lack of USB power. This may cause the slowdown. >>>>>>> >>>>>>> Usually 30/50 px of text height is fine. IF the problem is >>>>>>> tesseract, try to use the fast model (or "normal" if using best). I >>>>>>> assume >>>>>>> you are using the 4.x release. >>>>>>> >>>>>>> Try tesseract -v to see if you are using all the available CPU >>>>>>> optimizations. >>>>>>> >>>>>>> Try to move the SetSourceResolution outside the loop and see if it >>>>>>> changes something (MAYBE it may invalidate some caches or something). >>>>>>> >>>>>>> The time you are referring to is one single api.GetUTF8Text() call, >>>>>>> correct? >>>>>>> >>>>>>> >>>>>>> Lorenzo >>>>>>> >>>>>>> >>>>>>> Il giorno mar 29 gen 2019 alle ore 17:48 Jan Pohanka < >>>>>>> [email protected]> ha scritto: >>>>>>> >>>>>>>> Hello, >>>>>>>> >>>>>>>> I'm making a simple device used to recognize numbers on pictures >>>>>>>> taken by a webcam. All is running on raspberry pi 3. >>>>>>>> Everything is like following simple loop (in python for simplicity, >>>>>>>> but using C++ api it is the same), images are preprocessed to black >>>>>>>> and >>>>>>>> white >>>>>>>> >>>>>>>> api = PyTessBaseAPI(psm=tesserocr.PSM.SINGLE_WORD) >>>>>>>> >>>>>>>> for im in images: >>>>>>>> api.SetImage(im) >>>>>>>> api.SetSourceResolution(70) >>>>>>>> ot = api.GetUTF8Text() >>>>>>>> >>>>>>>> api.End() >>>>>>>> >>>>>>>> >>>>>>>> My problem is that api.GetUTF8Text() call is quite slow and more >>>>>>>> over it is getting slower and slower over time. Is there any options >>>>>>>> how to >>>>>>>> make recognition faster? I have tried to resize the image to around >>>>>>>> 50x10px. The times starts on around 300ms but then goes up to above 1s >>>>>>>> which is too slow for me. I tried both legacy and LSTM algorithms, but >>>>>>>> they >>>>>>>> are similar. >>>>>>>> >>>>>>>> best regards >>>>>>>> Jan >>>>>>>> >>>>>>>> -- >>>>>>>> You received this message because you are subscribed to the Google >>>>>>>> Groups "tesseract-ocr" group. >>>>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>>>> send an email to [email protected]. >>>>>>>> To post to this group, send email to [email protected]. >>>>>>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>>>>>> To view this discussion on the web visit >>>>>>>> https://groups.google.com/d/msgid/tesseract-ocr/a53b4b25-97e3-47dc-823a-cbb219225eed%40googlegroups.com >>>>>>>> >>>>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/a53b4b25-97e3-47dc-823a-cbb219225eed%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>>>>> . >>>>>>>> For more options, visit https://groups.google.com/d/optout. >>>>>>>> >>>>>>> -- >>>>>> You received this message because you are subscribed to the Google >>>>>> Groups "tesseract-ocr" group. >>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>> send an email to [email protected]. >>>>>> To post to this group, send email to [email protected]. >>>>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>>>> To view this discussion on the web visit >>>>>> https://groups.google.com/d/msgid/tesseract-ocr/baa59c86-b002-4607-8dda-16835cd3ea73%40googlegroups.com >>>>>> >>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/baa59c86-b002-4607-8dda-16835cd3ea73%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>>> . >>>>>> For more options, visit https://groups.google.com/d/optout. >>>>>> >>>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to [email protected]. >>>> To post to this group, send email to [email protected]. >>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>> To view this discussion on the web visit >>>> https://groups.google.com/d/msgid/tesseract-ocr/dedb7fd8-d61e-42bb-a492-34beaa8b1514%40googlegroups.com >>>> >>>> <https://groups.google.com/d/msgid/tesseract-ocr/dedb7fd8-d61e-42bb-a492-34beaa8b1514%40googlegroups.com?utm_medium=email&utm_source=footer> >>>> . >>>> For more options, visit https://groups.google.com/d/optout. >>>> >>> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected] <javascript:>. >> To post to this group, send email to [email protected] >> <javascript:>. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/30831c1e-4bd9-4eae-8118-ad9244949b80%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/30831c1e-4bd9-4eae-8118-ad9244949b80%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/2502861e-00ca-434a-89a6-51646d963f3f%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

