I add only one character like 30 times in the ben.training_text (that too in the end of the original training text), which meant i dint modified the original ben.training_text in large aspect. still why i am getting this "normalization failed" in many of the words which are already in the original training_text.
And then i tried to create training data without any extra character, whcih meant i only used the original training text, still i got this "normalization failed" and "Stripped 1 unrenderable words". why is this so?? On Wed, May 29, 2019 at 3:50 PM Shree Devi Kumar <shreesh...@gmail.com> wrote: > Check that the training text you used is normalized correctly, also check > the Bengali normalization/validation rules > https://github.com/tesseract-ocr/tesseract/issues/1038 > > > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To post to this group, send email to tesseract-ocr@googlegroups.com. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduURh2H0u7jMo1QyZh-cHwPMiNG0UA8G25JoJqs2L3mBkw%40mail.gmail.com > <https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduURh2H0u7jMo1QyZh-cHwPMiNG0UA8G25JoJqs2L3mBkw%40mail.gmail.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJxgoofBWrWhCJ-ogXnTj_sJDuUx%2BzWvBf%2Bj-huznsuM1pgZ5A%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.