maybe you can try '-c tessedit_char_whitelist="我愛你"', something like this.
un C <enya.ohfee.0...@gmail.com> 于2020年7月29日周三 下午5:27写道: > I am using tesseract v5.0.0-alpha.20200328. > > I tried ' -c tessedit_char_whitelist=0123456789,' it does work. > But for Chinese characters, neither '-c tessedit_char_whitelist=我愛你' nor > the unicode '-c tessedit_char_whitelist=\u6211\u611b\u4f60' work. > > Can anyone give me a hint? Thanks a lot. > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/e1fe3ec5-3df5-42c3-8ddd-faac75e22f77o%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/e1fe3ec5-3df5-42c3-8ddd-faac75e22f77o%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAPu7Z%2Bi77NHS-y2evCn%2B5H-cncL8FqM6oirLsGKYUBcWn29YCA%40mail.gmail.com.