1. Try to use the tesseract executable if there are any problems when using API/tesseract wrappers 2. Did you try image processing (as suggested by tesseract documentation? 3. Did you try custom image segmentation? Your image seems like a table and the tesseract layout analyze has a problem with tables.
Zdenko pi 28. 4. 2023 o 19:18 Are <arejan...@gmail.com> napísal(a): > Hello, > > I have this simple Tesseract code which takes the attached image and > prints the result to the console. > I cropped the image to only include the neccessary information (the full > document has sensitive information). Either way, using the cropped image or > the full one, it successfully reads most of the text, except for the text > with the consolas font. > > The output I get from the attached image is: ">BUWVveAmæUw >» >> U U" > Although, when I use the full image, it is able to read the bot > > I'm using the nor.traineddata, but the result is very similar with > eng.traineddata also. > > > > Here's my code: > > using System; > using Tesseract; > > namespace ConsoleApp1 > { > class Program > { > static void Main(string[] args) > { > using (var engine = new TesseractEngine(@"./tessdata", "nor", > EngineMode.Default)) > { > using (var img = > Pix.LoadFromFile(@"./images/unnamed2.jpg")) > { > using (var page = engine.Process(img)) > { > var text = page.GetText(); > Console.WriteLine(text); > } > } > } > } > } > } > > > > *Here's the image:* > > [image: unnamed2.jpg] > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/329a8635-723f-4664-957a-0ef952094912n%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/329a8635-723f-4664-957a-0ef952094912n%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8wTWduXBerDWTTnCwRB28yoSKX0fibEQqa%2BMvKHojxtfA%40mail.gmail.com.