After I run unicharset_extractor rashi.bold.exp1.box rashi.regular.exp1.box
I get some lines in the unicharset file that are not explained anywhere. For example, Joined 0 0,255,0,255,0,32767,0,32767,0,32767 NULL 0 0 0 # Joined [4a 6f 69 6e 65 64 ] |Broken|0|1 0 0,255,0,255,0,32767,0,32767,0,32767 NULL 0 0 0 # Broken What should I do with these? Also, the remaining lines don't match what the wiki says for training tesseract 3. E.g., 0,255,0,255,0,32767,0,32767,0,32767 NULL 0 0 0 # מ [5de ] and . 0 0,255,0,255,0,32767,0,32767,0,32767 NULL 0 0 0 # . [2e ] Any help would be appreciated. Thanks, -seth -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.

