RE: [tesseract-ocr] Re: Article scanning: hocr output wrong after font training?

2024-01-08 Thread Art Rhyno
There is a related thread on stack overflow that might be helpful for your processing [1]. The thread is about italics and bolding, but font detection seems a tougher challenge. This repository [2] has links to Adobe work in the area and has an interesting implementation. You would still probabl

[tesseract-ocr] Errors With Downloading Tesseract v4.1.1

2024-01-08 Thread Evaan Ahmed
Hey y'all! On my local machine (a Mac), I'm trying to download the version of Tesseract that is available on Google Colab. This is version 4.1.1. I downloaded the files from https://github.com/tesseract-ocr/tesseract/releases/tag/4.1.1 and tried to run the following commands: cd tesseract

Re: [tesseract-ocr] Errors With Downloading Tesseract v4.1.1

2024-01-08 Thread Zdenko Podobny
Please provide full log of whole process (starting from autogen.sh) Zdenko ut 9. 1. 2024 o 6:50 Evaan Ahmed napĂ­sal(a): > Hey y'all! > > On my local machine (a Mac), I'm trying to download the version of > Tesseract that is available on Google Colab. This is version 4.1.1. I > downloaded the f