Please see https://github.com/tesseract-ocr/langdata/issues/126
Replying there. On Monday, April 23, 2018 at 2:16:06 AM UTC+5:30, Christopher Imantaka Halim wrote: > > Hi, > > I want to develop an OCR for Javanese Script / Aksara. > https://en.wikipedia.org/wiki/Javanese_script > > Plan on using Tesseract version 4.0 > I've read the wiki but somehow got confused. > > What do I need to prepare, to start the bare minimum training process? > (for Tesseract 4.0) > In some other thread someone said that training using image files are not > supported yet. > Also found out that box file/tiff pairs are not supported also. > (I did try making one box file, using this online tool: > https://pp19dd.com/tesseract-ocr-chopper/) > > Do we have an example of the training "inputs" somewhere on the github > projects? > > Sorry if this is a stupid question, I'm a newbie. :) > > Thanks before > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/352c634a-33d7-43cf-b2eb-58b9385b93a7%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.