Hi, I'm trying to use Tesseract to recognise some characters from medical images, and I used JtessBoxEditor to edit box file and adapt it to errors in this image. But when I run tesseract after modify that box whit correct chars, it keeps making an output with previous box values.
I don't know if I am training it correctly, I get this messages: ** MF Training ** [C:\Users\FlcUser\Downloads\jTessBoxEditor-1.7.3\jTessBoxEditor\tesseract-ocr/mftraining, -F, eng.font_properties, -U, unicharset, -O, eng.unicharset, testImage.tr] Done! Read shape table shapetable of 6 shapes Reading testImage.tr ... Bad properties for index 3, char 1: 0,255 0,255 0,0 0,0 0,0 Bad properties for index 4, char 0: 0,255 0,255 0,0 0,0 0,0 Bad properties for index 5, char /: 0,255 0,255 0,0 0,0 0,0 Bad properties for index 6, char F: 0,255 0,255 0,0 0,0 0,0 Bad properties for index 7, char L: 0,255 0,255 0,0 0,0 0,0 Bad properties for index 8, char _: 0,255 0,255 0,0 0,0 0,0 Warning: no protos/configs for Joined in CreateIntTemplates() Warning: no protos/configs for |Broken|0|1 in CreateIntTemplates() What I'm doing wrong? When you train it specifically for one image shouldn't it be more accurate for this image? Thank You!!! -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/5fb11cca-8f16-470c-b718-d18582dcda13%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.