On Wednesday, August 7, 2019 at 4:10:44 PM UTC+2, Cristobal Jesus Muñoz 
Solano wrote:
>
> hello, I have already tried mrz.trainneddata yes quite good, but it is not 
> accurate. How can I do it to improve it? Is it possible to use box / png 
> files to improve its accuracy ?.
>
mrz.trainneddata was generated using thousands of images. I doubt you'll be 
able to increase the accuracy just by adding more data.

Most of the time the accuracy issues are related to poor image 
pre-processing.

You can try https://www.doubango.org/webapps/mrz/ which use 
mrz.trainneddata with the failing images to see if it works. If it works 
this means the issue is on the pre-processing.

If you share some sample images it would be easier to help you.

>
>
> El viernes, 2 de agosto de 2019, 12:07:34 (UTC-4), shree escribió:
>>
>> Have you tried 
>>
>>    - https://github.com/DoubangoTelecom/tesseractMRZ
>>
>>
>> On Fri, Aug 2, 2019 at 9:26 PM Cristobal Jesus Muñoz Solano <
>> cmun...@gmail.com> wrote:
>>
>>> Hello, I am trying to use tesseract and I have read all the 
>>> documentation and I have done many tests, sorry if this is not the place to 
>>> ask this question, but I have been researching for several days and I am 
>>> having many doubts and I do not know what to do or where to investigate , 
>>> I'm frustrated.
>>>
>>> 1) If I want to train tesseract to improve its efficiency by reading 
>>> images with font OCR-B, should I first do a tuning by adding the OCR-B 
>>> font? or I can create a trainnedata directly with the images/box and then 
>>> combine it with the best model.
>>>
>>> 2) How do I add many images / box to the best model.
>>>
>>> 3) Once you have a .trainneddata ready and save it in tessdata is it 
>>> enough for you to test when you run it use that data to read the images?
>>>
>>> I already tried this script
>>> https://github.com/Shreeshrii/tessdata_ocrb
>>>
>>> but I still don't understand how to add new training images to the best 
>>> model
>>>
>>> please help me, I don't want to kill myself so young
>>>
>>> -- 
>>> You received this message because you are subscribed to the Google 
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send 
>>> an email to tesser...@googlegroups.com.
>>> To view this discussion on the web visit 
>>> https://groups.google.com/d/msgid/tesseract-ocr/b63523ed-0e81-483b-a224-ada4c786fa3d%40googlegroups.com
>>>  
>>> <https://groups.google.com/d/msgid/tesseract-ocr/b63523ed-0e81-483b-a224-ada4c786fa3d%40googlegroups.com?utm_medium=email&utm_source=footer>
>>> .
>>>
>>
>>
>> -- 
>>
>> ____________________________________________________________
>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/6dc6593a-e44c-41c0-9eed-c9ec84b293e0%40googlegroups.com.

Reply via email to