I am planning to generate the training data in wordStr format, consider the following example -
WordStr 114 4640 1907 4692 0 #Information Groups for public OPTIONAL, jaundice Proterozoic Have LOCATION 1908 4640 1912 4692 0 >From above data [114, 4640, 1907, 4692] is the bounding box for the text that is -> "Information Groups for public OPTIONAL, jaundice Proterozoic Have LOCATION" But I am confused about the second line => "\t 1908 4640 1912 4692 0" Why do we need it and what it represents. [1908, 4640, 1912, 4692] this bounding box represents what information. Best, Gaurav. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/f6d111bf-52ea-4b7f-bae7-257c1a6764c3n%40googlegroups.com.