date:20221211

[tesseract-ocr] Re: A Simple grayscale image cannot be OCR'd

2022-12-11 Thread Alessandro Weber

Hi Andrew, your thought is exactly what I would do: replace RGB (201, 201, 201) with RGB (255, 255, 255) and then process the image with text on white ground. : ) Ale Il giorno domenica 11 dicembre 2022 alle 08:48:40 UTC+1 Andrew ha scritto: > I have processed imaged that seems pretty simple:

Re: [tesseract-ocr] A Simple grayscale image cannot be OCR'd

2022-12-11 Thread Zdenko Podobny

run this to understand what it problem: > tesseract 8fXlqZY.png 8fXlqZY --psm 7 get.images Then check the binarized version of your input that is used for OCR: 8fXlqZY.processed.tif There are 2 simple ways to solve the problem: 1. using only text ares for OCR (e.g. cropping image to text wit