+1 for this question. The training documentation for Tesseract 4.0 by now only covers training with font files (synthetic materials). What is missing is information on training with real data (i.e. manually aligned ground truth). Any hints on that matter are greatly appreciated.
Cheers, Kay On Wednesday, January 18, 2017 at 12:31:54 AM UTC+1, chen...@huawei.com wrote: > > I have a bunch of images, containing English words. > I would like to generate training data by these images, and do the > training. > How should I do? > > Thanks a lot. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/7bffab95-3e6b-4165-929e-a152f1799703%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.