- I have been training a large amount of data: about 390,000 lines of text for each font: for 15 fonts. I run around 1.2 million iterations. The progress was encouraging to some degree. But, at some point, the BCER started to get down fast; and reached at 0.01 error. The training stopped at that point. I was excited thinking that my model is getting as accurate as it gets. But, turns out, it is no where as good as the default (RAy's) best model.
The am doing fine tuning by removing the top layer of the network. This was the command I used to train: *make training MODEL_NAME=amh START_MODEL=amh APPEND_INDEX=5 NET_SPEC='[Lfx256 O1c105]' TESSDATA=../tesseract/tessdata MAX_ITERATIONS=500000 DEBUG_INTERVAL=-1 training >> data/amh.log &* What do you think is going on here? Do you think I am over-fitting the model? -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/a71c6a84-35fa-4b19-81ad-4e24bf36d8a4n%40googlegroups.com.