I can already generate the .box files using listbox from png images but I don't understand what follows. How can I use them to improve the best model eng.trainneddata?
El miércoles, 7 de agosto de 2019, 10:40:40 (UTC-4), Mamadou escribió: > > > > On Wednesday, August 7, 2019 at 4:10:44 PM UTC+2, Cristobal Jesus Muñoz > Solano wrote: >> >> hello, I have already tried mrz.trainneddata yes quite good, but it is >> not accurate. How can I do it to improve it? Is it possible to use box / >> png files to improve its accuracy ?. >> > mrz.trainneddata was generated using thousands of images. I doubt you'll > be able to increase the accuracy just by adding more data. > > Most of the time the accuracy issues are related to poor image > pre-processing. > > You can try https://www.doubango.org/webapps/mrz/ which use > mrz.trainneddata with the failing images to see if it works. If it works > this means the issue is on the pre-processing. > > If you share some sample images it would be easier to help you. > >> >> >> El viernes, 2 de agosto de 2019, 12:07:34 (UTC-4), shree escribió: >>> >>> Have you tried >>> >>> - https://github.com/DoubangoTelecom/tesseractMRZ >>> >>> >>> On Fri, Aug 2, 2019 at 9:26 PM Cristobal Jesus Muñoz Solano < >>> cmun...@gmail.com> wrote: >>> >>>> Hello, I am trying to use tesseract and I have read all the >>>> documentation and I have done many tests, sorry if this is not the place >>>> to >>>> ask this question, but I have been researching for several days and I am >>>> having many doubts and I do not know what to do or where to investigate , >>>> I'm frustrated. >>>> >>>> 1) If I want to train tesseract to improve its efficiency by reading >>>> images with font OCR-B, should I first do a tuning by adding the OCR-B >>>> font? or I can create a trainnedata directly with the images/box and then >>>> combine it with the best model. >>>> >>>> 2) How do I add many images / box to the best model. >>>> >>>> 3) Once you have a .trainneddata ready and save it in tessdata is it >>>> enough for you to test when you run it use that data to read the images? >>>> >>>> I already tried this script >>>> https://github.com/Shreeshrii/tessdata_ocrb >>>> >>>> but I still don't understand how to add new training images to the best >>>> model >>>> >>>> please help me, I don't want to kill myself so young >>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to tesser...@googlegroups.com. >>>> To view this discussion on the web visit >>>> https://groups.google.com/d/msgid/tesseract-ocr/b63523ed-0e81-483b-a224-ada4c786fa3d%40googlegroups.com >>>> >>>> <https://groups.google.com/d/msgid/tesseract-ocr/b63523ed-0e81-483b-a224-ada4c786fa3d%40googlegroups.com?utm_medium=email&utm_source=footer> >>>> . >>>> >>> >>> >>> -- >>> >>> ____________________________________________________________ >>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >>> >> -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/833e0e09-07cb-4c9d-945a-95fe5a13f4a2%40googlegroups.com.