[tesseract-ocr] Re: Issue with Fine-Tuning eng.traineddata on Large Dataset: Negative Mean RMS Error

2024-01-31 Thread Tom Morris
On Tuesday, January 30, 2024 at 11:13:06 AM UTC-5 Ilyas wrote: The output I'm wondering about is : At iteration 1/600/600, Mean rms=-2147483.6%, delta=0.033%, char train=275.696%, word train=100%, skip ratio=0%, New worst char error = 275.696 wrote checkpoint. I expected the training process

Re: [tesseract-ocr] OCR of free hand photo of book

2024-01-31 Thread Zdenko Podobny
Tesseract is OCR engine and the user is responsible for preprocessing - see the documentation. IMO there is already app (using tesseract) for what you try to do: Text Fairy [1] [1] https://play.google.com/store/apps/details?id=com.renard.ocr&hl=en Zdenko st 31. 1. 2024 o 2:00 Borneq napísal(a

Re: [tesseract-ocr] Re: I need help to develop image to text extraction

2024-01-31 Thread Santhiya C
Already i was used above mentioned steps but i lost the datas On Saturday 27 January 2024 at 06:52:54 UTC+5:30 g...@hobbelt.com wrote: > L.S., > > *PDF. OCR. text extraction. best language models? not a lot of success > yet...* > > 🤔 > > Broad subject. Learning curve ahead. 🚧 Workflow diagra