[tesseract-ocr] OpenCL with tesseract

2022-06-28 Thread Ahmad Anis
I am installing OpenCL. I am not sure if it is installed or not. Here is the output of my clinfo. $ clinfo clinfo Number of platforms 1 Platform Name Intel(R) OpenCL HD Graphics Platform Vendor Intel(R) Corporation Platform Version OpenCL 3.0 Platform Profile FULL_PROFILE Platform Extension

[tesseract-ocr] Re: OpenCL with tesseract

2022-06-28 Thread Ahmad Anis
Installing tesseract version 5 gave detected more things but not openCL. ``` tesseract --version tesseract 5.1.0-32-gf36c0 leptonica-1.79.0 libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 2.0.3) : libpng 1.6.37 : libtiff 4.1.0 : zlib 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.1 Found AVX2 Found AVX

Re: [tesseract-ocr] Extracting alphanumeric identifiers (ISINs)

2022-06-28 Thread 'Stefan Bretzel' via tesseract-ocr
Hi Zdenko, indeed, avoiding OCR completely for this kind of documents would be desirable though not possible for us. As a matter of fact, we already use ISIN validation and do quite a lot of postprocessing on the extracted ISIN as an attempt to compensate for OCR misreadings. In doing so, we us