I do not know. The trick with upscaling is here from version 3.x. The trick with downscaling works from version 4.x Just looking at Willus Dotkom's chart[1] I would guess there is some design decision... But without explanation from original/google programmers, we can just guess or find a bug ;-)
[1] https://groups.google.com/group/tesseract-ocr/attach/51b840d4782db/tess4_error_rate.png?part=0.2&view=1 Zdenko ne 27. 2. 2022 o 11:27 Merlijn B.W. Wajer <merl...@archive.org> napĂsal(a): > Hi, > > On 27/02/2022 08:55, Zdenko Podobny wrote: > > tesseract fix_size.png - > > > > 0326 > > 0939 > > 1552 > > 2206 > > > > > > See doc for explaining: > > > https://github.com/tesseract-ocr/tessdoc/blob/main/ImproveQuality.md#rescaling > > < > https://github.com/tesseract-ocr/tessdoc/blob/main/ImproveQuality.md#rescaling > > > > Thanks for the suggestion, I'm also running into this problem in some > cases. Is it possible that this is also some kind of segmentation bug? I > wonder what Tesseract finds here in this clear image that causes it to > produce an extra character. > > Regards, > Merlijn > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/2435ccff-11e1-0848-6d57-600a4262d963%40archive.org > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8z6OCv7bbAj3-97LtrUMHYBsVG9UFLPNbyuP1Qv3a41-A%40mail.gmail.com.