I am working on a volunteer project to digitize the Sutra and all related materials, most of them in Tibetan. It will save a lot of time if I can use some OCR technology in this process. However, there are hardly any software available for Tibetan. Therefore, I wonder how I can get help to use Tesseract for Tibetan. (I am new on both OCR and Tesseract and the only programming language I know is R.) I have no idea how to get started, training Tesseract for a new language? Tibetan? And what if the image contains both Chinese and Tibetan? Please give me some hints. Thanks a lot.
-- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/0959cecf-8d21-4a9c-b6bf-b53227439e6a%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

