Re: [tesseract-ocr] tessract usage

2024-01-01 Thread Zdenko Podobny
Did you check license? https://github.com/tesseract-ocr/tesseract/blob/main/LICENSE Zdenko st 27. 12. 2023 o 17:56 Ajay Bhosle napísal(a): > Can i use tesseract to extract text from pdf for commercial use? > > -- > You received this message because you are subscribed to the Google Groups > "t

Re: [tesseract-ocr] Failed to load list of training filenames from data/foo/list.train

2024-01-01 Thread Zdenko Podobny
Follow https://github.com/tesseract-ocr/tesstrain/blob/main/README.md Tesseract OCR 3.05.02 was released 6 years ago... Zdenko so 30. 12. 2023 o 18:24 Omar Samir napísal(a): > I was trying to train Tesseract-OCR on the ocrd-testset.zip in the README, > and I get this error above in the subject

Re: [tesseract-ocr] Phantom characters

2024-01-01 Thread Zdenko Podobny
post: 1. Original image (without preprocessing) 2. + image used for OCR (preprocessed) 3. + output from tesseract executable (not tesseract wrappers) and used parameters/option Otherwise, nobody can reproduce the problem and therefore suggest a solution. Zdenko ne 31. 12. 2023 o 10