maybe you can try '-c tessedit_char_whitelist="我愛你"', something like this.

un C <enya.ohfee.0...@gmail.com> 于2020年7月29日周三 下午5:27写道:

> I am using tesseract v5.0.0-alpha.20200328.
>
> I tried ' -c tessedit_char_whitelist=0123456789,' it does work.
> But for Chinese characters, neither '-c tessedit_char_whitelist=我愛你' nor
> the unicode '-c tessedit_char_whitelist=\u6211\u611b\u4f60' work.
>
> Can anyone give me a hint? Thanks a lot.
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/e1fe3ec5-3df5-42c3-8ddd-faac75e22f77o%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/e1fe3ec5-3df5-42c3-8ddd-faac75e22f77o%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAPu7Z%2Bi77NHS-y2evCn%2B5H-cncL8FqM6oirLsGKYUBcWn29YCA%40mail.gmail.com.

Reply via email to