Re: [tesseract-ocr] Tesseract training for New font/language

2023-10-02 Thread Fish Money
please share sample of image you're trying to recognize суббота, 1 апреля 2023 г. в 10:56:58 UTC-4, ali8a...@gmail.com: > Is it best to train a new language? > > On Saturday, April 1, 2023 at 7:54:30 a.m. UTC-7 shree wrote: > >> Aurebesh seems to be different symbols mapped to the English alpha

Re: [tesseract-ocr] Tesseract training for New font/language

2023-04-01 Thread Ali Abedian
Is it best to train a new language? On Saturday, April 1, 2023 at 7:54:30 a.m. UTC-7 shree wrote: > Aurebesh seems to be different symbols mapped to the English alphabet > rather than a new font for English, hence training would need to be for a > new language rather than just fine-tuning. > >

Re: [tesseract-ocr] Tesseract training for New font/language

2023-04-01 Thread Shree Devi Kumar
Aurebesh seems to be different symbols mapped to the English alphabet rather than a new font for English, hence training would need to be for a new language rather than just fine-tuning. On Sat, Apr 1, 2023, 10:47 Ali Abedian wrote: > Hello, > > Thank you for providing the references, but I'm st

Re: [tesseract-ocr] Tesseract training for New font/language

2023-04-01 Thread Ali Abedian
Hello, Thank you for providing the references, but I'm still a bit confused. I have trained tesseract using the same method as described in https://github.com/tesseract-ocr/tesstrain/blob/main/ocrd-testset.zip, with 100,000 sentences and a maximum iteration of 10,000. However, it still canno

Re: [tesseract-ocr] Tesseract training for New font/language

2023-04-01 Thread Zdenko Podobny
Please have a look at https://github.com/tesseract-ocr/tesstrain (especially https://github.com/tesseract-ocr/tesstrain/blob/main/ocrd-testset.zip) Zdenko pi 31. 3. 2023 o 7:03 Ali Abedian napísal(a): > Hey everyone! I'm currently working on a personal project where I'm > training a new font