I found myself. Please see language-specific.sh VERTICAL_FONTS(= \ "TAKAoExGothic" \ #for jpn "TAKAoExMincho" \ #for jpn "WHAT_EVER_FONT_YOU_WANT_TO_ADD" )
when you execute tesstrain.sh \ --font_dir /usr/share/font --lang jpn ... --fontlist "WHAT_EVER_FONT_YOU_WANT_TO_ADD" You will see the box file and tiff file where characters are vertically aligned. Thanks! On Sun, Oct 7, 2018 at 12:56 PM Seokbong Choi <zodiac3...@gmail.com> wrote: > Hello, > > I am a Japanese comic book fan. Recently, I come to learn about tesseract, > which is awesome. > There are many challenges around Japanese - it has millions characters, so > that millions of iteration are required to train. > > Another challenge is vertical text. Most of comic books use vertical > alignment for the text. > I am trying to train tesseract based upon JPN_VERT (I already successfully > trained JPN). > However, I am not able to find a way to generate "box" file, which is > aligned vertically to train JPN_VERT further. > Any idea? > > Thanks in advance. > > Greg. > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To post to this group, send email to tesseract-ocr@googlegroups.com. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/c82e323c-17b3-4b1e-a8a9-074fadb88528%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/c82e323c-17b3-4b1e-a8a9-074fadb88528%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CA%2BVWkA4fF9kYUXxf7Rv2TS_uxxS90S72df-qFU_LXZAZ%2BxtTaA%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.