Re: [tesseract-ocr] tesseract 4 on Debian Bullseye

2022-01-24 Thread Rich M
Looking at the .PDF to .tiff conversion, it might be an issue with convert, provided by imagemagick. Using a different CLI pdf to image conversion, tesseract seems to be working better. On Sun, Jan 23, 2022 at 8:44 PM Rich M wrote: > Please provide details for reproducing problem: input image, o

Re: [tesseract-ocr] tesseract 4 on Debian Bullseye

2022-01-23 Thread Zdenko Podobny
Please send the tesseract relevant file - tiff ;-) . First think you always need to check the tesseract input. Input of your script (pdf) is not important in this stage. Zdenko po 24. 1. 2022 o 4:44 Rich M napísal(a): > Please provide details for reproducing problem: input image, output pdf

Re: [tesseract-ocr] tesseract 4 on Debian Bullseye

2022-01-21 Thread Rich M
Sure. I'll need to find a test file that doesn't contain private information. Before seeing your response now, I ran my script on a file that I had converted to a searchable PDF last year and the output file was very poor. Out of curiosity, I changed the converted image from .tiff to .png and t

Re: [tesseract-ocr] tesseract 4 on Debian Bullseye

2022-01-19 Thread Zdenko Podobny
Please provide details for reproducing problem: input image, output pdf, tesseract details (tesseract -v) Zdenko št 20. 1. 2022 o 5:03 Rich M napísal(a): > Hi, > > I'm fairly new to tesseract and had a written a bash script in Debian > Buster(previous release) using tesseract 3 which worked ve

[tesseract-ocr] tesseract 4 on Debian Bullseye

2022-01-19 Thread Rich M
Hi, I'm fairly new to tesseract and had a written a bash script in Debian Buster(previous release) using tesseract 3 which worked very well. I've since upgraded my OS to the next stable release, Bullseye which also upgraded tesseract to V4. After the upgrade, tesseract isn't "working" any long