[tesseract-ocr] The pictures captured by the camera did not identify well after preprocessing

vis li Wed, 15 Sep 2021 22:59:32 -0700


Tesseract Version：4.1.1
Platform:Window10
<https://user-images.githubusercontent.com/51877381/133545017-12e2b715-be45-4198-8035-9838c5375ea9.png>[image:
 
testa.png]
<https://user-images.githubusercontent.com/51877381/133545026-66cdd822-6885-4561-aa8c-d13496573a62.png>[image:
 
testb.png]
Page.getText():


ACBEDFHGIKJLNHOP
RQSUTV¥WYaZbdcef

1ppp000012121010
&*(O+-,.:; O=%/

like this，the result has some faults.
I know that my image has some defects,but how can i improve this situation?
I have done the binarization of the picture,and try to improve dpi to 300
Because the pictures captured by the camera,I am worried if they can meet 
the standard for web pictures

I have used LTSM mode ,and my Identified word library file is trained by 
LTSM and Microsoft Yahei Standard font


-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/96ce0479-bc22-477d-9d5b-a6408509121fn%40googlegroups.com.

[tesseract-ocr] The pictures captured by the camera did not identify well after preprocessing

Reply via email to