Using different interpolation methods of magnification gave me different results, but I was not able to get the "/" character out of the string. Magnifying the image by 200% using a Box, Triangle, or Catmull-Rom interpolation algorithm gave me "NIA". Using Mitchell, I got "NVA". The Cubic B-Spline was too fuzzy for Tesseract to recognize any of the characters.
Does anyone have any further ideas? I wish there was a way to tell Tesseract to ignore font embellishments, such as italics or underlining. On Tuesday, October 8, 2024 at 10:16:17 AM UTC-5 pankaj....@gmail.com wrote: > Hi > Did you try this trick ?? > > On Tue, 8 Oct 2024, 20:42 Art Rhyno, <artr...@uwindsor.ca> wrote: > >> You could try resizing the image, with imagemagick, something like: >> >> >> >> convert test.bmp -resize 200% test.png >> >> >> >> That seems to be enough to separate out the “N” and the “/”. >> >> >> >> art >> >> >> >> *From:* tesser...@googlegroups.com <tesser...@googlegroups.com> *On >> Behalf Of *Will Fetherolf >> *Sent:* Monday, October 7, 2024 9:33 PM >> *To:* tesseract-ocr <tesser...@googlegroups.com> >> *Subject:* [tesseract-ocr] Help with recognition please >> >> >> >> You don't often get email from will.fe...@gmail.com. Learn why this is >> important <https://aka.ms/LearnAboutSenderIdentification> >> >> The application I'm attempting to OCR is using what I think is Arial for >> the font, but every time I run the attached image through Tesseract 5.4.0 >> on Windows I get "NVA" or "NIA" depending on which PSM I use. If I use 7, >> I always get back "NIA". I have tried running training on a variety of >> captured data from my application with no success. >> >> >> >> Help me, Obi-Wan Kenobi, you're my only hope! >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to tesseract-oc...@googlegroups.com. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/01ab548e-e45e-48b7-824d-73debed1adb1n%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/01ab548e-e45e-48b7-824d-73debed1adb1n%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to tesseract-oc...@googlegroups.com. >> > To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/YQBPR0101MB85429847B45CE3732F0ECE5FDC7E2%40YQBPR0101MB8542.CANPRD01.PROD.OUTLOOK.COM >> >> <https://groups.google.com/d/msgid/tesseract-ocr/YQBPR0101MB85429847B45CE3732F0ECE5FDC7E2%40YQBPR0101MB8542.CANPRD01.PROD.OUTLOOK.COM?utm_medium=email&utm_source=footer> >> . >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/dadea12d-5d9f-4e9d-a6e0-72582e239eb8n%40googlegroups.com.