[tesseract-ocr] Textlines detection algorithm used in tesseract

2022-08-08 Thread 张晓艺
Hi all, I'm looking a non-deeplearning-based textlines detection/segmentation algorithm, since my problem is relatively easy (without complexed background). I wrote several scripts with a combination of OpenCV ops like dilate, erode and findContours etc.. But I found it's very sensitive to hyp

Re: [tesseract-ocr] Extraction error for PNG files - pixCreateNoInit: pix_malloc fail for data

2022-08-08 Thread Zdenko Podobny
try the latest tesseract and letonica version - there were some improvement for big size images. On Thu, 28 Jul 2022, 10:48 Gaurav Verma, wrote: > Hi, > I am trying extracting text from some PNG images on windows Server 2019 > Standard using Tesseract OCR 5.0.1 but getting some image validation