Re: [tesseract-ocr] Removing colors

2021-01-08 Thread Deepak Sharma
are there any equivalent function in OpenCV which can do similar executions like what you did with leptonica? On Friday, January 8, 2021 at 12:20:29 AM UTC+5:30 zdenop wrote: > Unfortunately I am not aware of (maintained) python leptonica support (any > volunteers?), but you can directly use le

Re: [tesseract-ocr] Removing colors

2021-01-07 Thread Zdenko Podobny
Unfortunately I am not aware of (maintained) python leptonica support (any volunteers?), but you can directly use leptonica&tesseract via cffi in python. See some examples : https://sk-spell.sk.cx/building-minimalistic-tesseract https://github.com/zdenop/SimpleTesseractPythonWrapper/blob/master/Sim

Re: [tesseract-ocr] Removing colors

2021-01-07 Thread Deepak Sharma
can you suggest me with an alternate for leptonica for "python & windows" On Thursday, January 7, 2021 at 1:42:28 AM UTC+5:30 zdenop wrote: > try to play with the leptonica pixAutoPhotoinvert function[1]. > quick test with following C code snippets provided attached result: > > pix = leptonica.pi

Re: [tesseract-ocr] Removing colors

2021-01-06 Thread Zdenko Podobny
try to play with the leptonica pixAutoPhotoinvert function[1]. quick test with following C code snippets provided attached result: pix = leptonica.pixRead("des_resume3.png"); pix1 = leptonica.pixThresholdToBinary(pix, 170); autoinverted = pixAutoPhotoinvert(pix1, thresh, NULL, NULL); pixWrite("aut

Re: [tesseract-ocr] Removing colors

2021-01-06 Thread Balasundaram Chinnaiyan
convert the image to grayscale and remove the gray . or use HSV colour code to remove it. Regards, Bala On Wed, 6 Jan 2021 at 22:13, Deepak Sharma wrote: > I am trying to preprocess resumes for building an OCR model. Please refer > to the reference image attached in this message. > As you can