I'm trying to finetune Tesseract to recognize digits only but I'm not getting good results so far. I continued the training from Arabic language "ara" since the digits I'm trying to recognize are Arabic numbers. The training will stop early at 0.01 error rate but the results on testing data is really bad.
I'm using my box/tif files and my training text with Tesstrain.h Any recommendation on what should I do to get better results? -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/aba88134-bcbe-405f-b1bb-520275cc4227n%40googlegroups.com.