- I have been training a large amount of data: about 390,000 lines of text 
for each font: for 15 fonts. I run around 1.2 million iterations. The 
progress was encouraging to some degree. But, at some point, the BCER 
started to get down fast; and reached at 0.01 error. The training stopped 
at that point. 
I was excited thinking that my model is getting as accurate as it gets. 
But, turns out, it is no where as good as the default (RAy's) best model. 

The am doing fine tuning by removing the top layer of the network. 
This was the command I used to train: *make training MODEL_NAME=amh 
START_MODEL=amh APPEND_INDEX=5 NET_SPEC='[Lfx256 O1c105]' 
TESSDATA=../tesseract/tessdata MAX_ITERATIONS=500000 DEBUG_INTERVAL=-1 
training >> data/amh.log &*

What do you think is going on here?
Do you think I am over-fitting the model?

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/a71c6a84-35fa-4b19-81ad-4e24bf36d8a4n%40googlegroups.com.

Reply via email to