[tesseract-ocr] Re: how to combine (merge) trained data of Tesseract files

2022-09-28 Thread Quan Nguyen
Merging two traindeddata files is not possible nor supported. What you can do is rename your custom language pack to, say, eng1.traindeddata and then specify -l eng+eng1 when running the tesseract executable. On Sunday, September 25, 2022 at 4:22:45 PM UTC-5 fishmo...@gmail.com wrote: > >

Re: [tesseract-ocr] Re: ErrorInInitializerError - zip file closed out of tess4j.util.LoadLibs.getTesseractLibName

2022-09-28 Thread Quan Nguyen
You try the suggested program to see if the same exception occurs. If it does not, then it's possible that something in your code is not right. VietOCR is open source; you can browse through the code to see how it works. Another suggestion: try to use the alternate Tesseract1 API. On Friday, Se

[tesseract-ocr] Re: Tess4j failing near load of shared library tesseract-ocr-5.2 in Java 11 and 17, succeeds in Java 8

2022-09-28 Thread Quan Nguyen
The source of tess4j is available; you can trace through the code to see what threw the exception. Nevertheless, "throwable while reading PDF" seems to point to the part of code that reads in PDF file. Was that something you wrote, or from tess4j itself? On Sunday, September 25, 2022 at 11:02:

[tesseract-ocr] Re: Tess4j failing near load of shared library tesseract-ocr-5.2 in Java 11 and 17, succeeds in Java 8

2022-09-28 Thread Quan Nguyen
PDF files are read by PDFBox library. You may want to look into that area as well. On Wednesday, September 28, 2022 at 10:52:15 PM UTC-5 Quan Nguyen wrote: > The source of tess4j is available; you can trace through the code to see > what threw the exception. > > Nevertheless, "throwable while r