[tesseract-ocr] HRe: tesseract 4.1.1 slow in aws instance centos7

2022-11-28 Thread Giuseppe Coniglio
Hi, I have same problem in my Oracle Linux Server 8.6 tesseract 4.1.1 leptonica-1.76.0 libgif 5.1.4 : libjpeg 6b (libjpeg-turbo 1.5.3) : libpng 1.6.34 : libtiff 4.0.9 : zlib 1.2.11 : libwebp 1.0.0 Found AVX2 Found AVX Found FMA Found SSE Il giorno mercoledì 9 novembre 2022 alle 15:31:50

[tesseract-ocr] Extract Text From A Scanned PDF Using OCR In Java: low elaboration in a Oracle Linux Server 8.6

2022-11-28 Thread Giuseppe Coniglio
Hi to all, I have implemented a Spring boot microservice which use tess4j 4.3.1 and pdfbox 2.0.22 in my server Oracle Linux Server , example code https://colwil.com/how-to-extract-text-from-a-scanned-pdf-using-ocr-in-java/ When I execute code with my IDE on windows pc and invoke local service,

Re: [tesseract-ocr] HRe: tesseract 4.1.1 slow in aws instance centos7

2022-11-29 Thread Giuseppe Coniglio
is unsupported) > > Zdenko > > > po 28. 11. 2022 o 15:49 Giuseppe Coniglio napísal(a): > >> Hi, I have same problem in my Oracle Linux Server 8.6 >> >> tesseract 4.1.1 >> leptonica-1.76.0 >> libgif 5.1.4 : libjpeg 6b (libjpeg-turbo 1.5.3) : l

[tesseract-ocr] Re: Extract Text From A Scanned PDF Using OCR In Java: low elaboration in a Oracle Linux Server 8.6

2022-11-29 Thread Giuseppe Coniglio
Code is https://medium.com/gft-engineering/creating-an-ocr-microservice-using-tesseract-pdfbox-and-docker-155beb7f2623 Have a nice day Il giorno lunedì 28 novembre 2022 alle 15:50:10 UTC+1 Giuseppe Coniglio ha scritto: > Hi to all, > I have implemented a Spring boot microservice whi

Re: [tesseract-ocr] Can't get bib#'s from tshirt JPG: should be simple.

2022-11-30 Thread Giuseppe Coniglio
Hi Tim, below the code Tesseract tesseract = new Tesseract(); try { tesseract.setDatapath("tessdata"); // the path of your tess data folder inside the extracted file String text = tesseract.doOCR(new File("c:\\temp\\TSP_12484529.JPG")); // path of your image file System.out.print(text); } catch (T

[tesseract-ocr] File tesseract.log is empty in Linux Server

2023-04-17 Thread Giuseppe Coniglio
Hi to all :-) I developed a Spring boot microservice that uses Tesseract 4.3.1 and PDFBox 2.0.22 (maven dependency in pom.xml) The Linux server has Tesseract 4.1.1 installed (lepton-1.76.0) Java project is done by this example https://colwil.com/how-to-extract-text-from-a-scanned-pdf-using-ocr-i