I'd like to produce high-quality OCR of books that contain text interspersed with music. Is it possible to train Tesseract to ignore musical notation instead of turning it into junk OCR? How would one go about doing this?
-- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/3988b3d7-d757-4c10-9a66-f7aa34a65b6f%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.