On Monday, August 5, 2024 at 8:15:27 PM UTC-4 Danny wrote:
So, I'm thinking the issue is with the preprocessing, segmentation, and glyph identification more than the model itself. I agree with that and I suspect you can do a better job of line segmentation than Tesseract can since you have more information available to you about font size, context, etc. Tom -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/3ae54d49-1fa7-4ea9-818b-8ed0435876ddn%40googlegroups.com.