On Wednesday, August 7, 2019 at 4:10:44 PM UTC+2, Cristobal Jesus Muñoz Solano wrote: > > hello, I have already tried mrz.trainneddata yes quite good, but it is not > accurate. How can I do it to improve it? Is it possible to use box / png > files to improve its accuracy ?. > mrz.trainneddata was generated using thousands of images. I doubt you'll be able to increase the accuracy just by adding more data.
Most of the time the accuracy issues are related to poor image pre-processing. You can try https://www.doubango.org/webapps/mrz/ which use mrz.trainneddata with the failing images to see if it works. If it works this means the issue is on the pre-processing. If you share some sample images it would be easier to help you. > > > El viernes, 2 de agosto de 2019, 12:07:34 (UTC-4), shree escribió: >> >> Have you tried >> >> - https://github.com/DoubangoTelecom/tesseractMRZ >> >> >> On Fri, Aug 2, 2019 at 9:26 PM Cristobal Jesus Muñoz Solano < >> cmun...@gmail.com> wrote: >> >>> Hello, I am trying to use tesseract and I have read all the >>> documentation and I have done many tests, sorry if this is not the place to >>> ask this question, but I have been researching for several days and I am >>> having many doubts and I do not know what to do or where to investigate , >>> I'm frustrated. >>> >>> 1) If I want to train tesseract to improve its efficiency by reading >>> images with font OCR-B, should I first do a tuning by adding the OCR-B >>> font? or I can create a trainnedata directly with the images/box and then >>> combine it with the best model. >>> >>> 2) How do I add many images / box to the best model. >>> >>> 3) Once you have a .trainneddata ready and save it in tessdata is it >>> enough for you to test when you run it use that data to read the images? >>> >>> I already tried this script >>> https://github.com/Shreeshrii/tessdata_ocrb >>> >>> but I still don't understand how to add new training images to the best >>> model >>> >>> please help me, I don't want to kill myself so young >>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to tesser...@googlegroups.com. >>> To view this discussion on the web visit >>> https://groups.google.com/d/msgid/tesseract-ocr/b63523ed-0e81-483b-a224-ada4c786fa3d%40googlegroups.com >>> >>> <https://groups.google.com/d/msgid/tesseract-ocr/b63523ed-0e81-483b-a224-ada4c786fa3d%40googlegroups.com?utm_medium=email&utm_source=footer> >>> . >>> >> >> >> -- >> >> ____________________________________________________________ >> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/6dc6593a-e44c-41c0-9eed-c9ec84b293e0%40googlegroups.com.