On 20 August 2010 12:53, colbec <col...@start.ca> wrote: > Using tesseract 3.00 on Opensuse 11.2. From CLI as in > tesseract file.tif file > > In an image that contains a line of '=' signs the recognition is much > worse than if these lines are removed, eg: > > line 1 and stuff > ======================= > line 3 and stuff > > line 1 will be recognized, but the second and third lines will be > either missing or line 2 missing and line 3 garbled. > If the file contains lines 1 and 3 only, the recognition is almost > perfect. > > Since the "=" character appears to be in the trained charset, what > kind of error does this represent for tesseract?
At a guess - without providing a sample image, that's the best you can expect - I would say that the line of equals is being treated as noise. -- <Leftmost> jimregan, that's because deep inside you, you are evil. <Leftmost> Also not-so-deep inside you. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-...@googlegroups.com. To unsubscribe from this group, send email to tesseract-ocr+unsubscr...@googlegroups.com. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.