[tesseract-ocr] Using lang files from 3.04 with T4 legacy mode

estel4ever Mon, 08 Apr 2019 22:56:50 -0700

Hi everyone!

Have been using T3.04 for a while and have created several language files 
to improve ocr quality for specific pdfs.
After moving to T4 overall quality increased with default eng language 
file, but there is still one pdf type where I get a lot of digits 
incorrectly (5 is treated is 6, 9 as 8, etc).


I wanted to run those pdfs through legacy mode (oem 0), but T4 cannot load 
the trainneddata file from T3.04.
Is this possible?
Haven't found any relevant info in docs or here :(


Regards, Alex

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/5c482953-5794-4cc6-8861-80e9467d048d%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[tesseract-ocr] Using lang files from 3.04 with T4 legacy mode

Reply via email to