It's a text file not trained, I put it in the scripts file but it didn't work Returns empty And the same with ara characters returns empty or incorrect answer
On 15 May 2020 14:20, "Piyush Chandra" <piyushs...@gmail.com> wrote: > You need to put the radical stroke file in your script_dir folder. > > On Friday, 15 May 2020 14:57:36 UTC+5:30, nourhan magdy wrote: >> >> how can i use this text file? i downloaded ara folder and coppied it to >> my tessdata but it didnt work >> >> On Friday, September 27, 2019 at 10:01:11 AM UTC+2, shree wrote: >>> >>> You are missing https://github.com/tesseract-ocr/langdata_lstm/blob/ >>> master/radical-stroke.txt >>> >>> On Fri, Sep 27, 2019 at 12:59 PM Béchir Gmati <bechir...@gmail.com> >>> wrote: >>> >>>> hi plz i have this error when i execute the command line of >>>> combine-lang-model how i can fix it >>>> [image: Capture.JPG] >>>> [image: Capture1.JPG] >>>> -- >>>> * GMATI Béchir* >>>> *Élève Ingénieur Business Intelligence & Big Data* >>>> >>>> >>>> >>>> Le mer. 25 sept. 2019 à 15:44, Mobeen Ali <moby...@gmail.com> a écrit : >>>> >>>>> *Current Behavior:* >>>>> I've followed the wiki and details given in the wiki/Training >>>>> Tesseract - 4.00. There were no errors in creation of the traineddata >>>>> file. >>>>> >>>>> I wanted to create my own ara_custom.traineddata file specifically to >>>>> read dates in arabic, so it has "٠١٢٣٤٥٦٧٨٩" (0-9 numeric characters in >>>>> arabic) with a "/" forward slash only. >>>>> >>>>> *The format for arabic date is:* >>>>> *٢٠١٩/٠٩/٢٥* >>>>> *yyyy/mm/dd* >>>>> >>>>> *My ara.training_text file is: *attached as ara.training_text.txt >>>>> (for uploading only else i use the file without txt extension) >>>>> >>>>> *My ara.wordlist file is: * attached as ara.wordlist.txt (for >>>>> uploading only else i use the file without txt extension) >>>>> >>>>> *Text in image:* ٢٠٠٩/١١/١٢ *(32.jpg)* >>>>> *Tesseract reads:* ٢٤٠٩/١١/١٢ *(32.txt)* >>>>> >>>>> *Text in image:* ١٩٧٩/٠١/٢٨ *(24.jpg)* >>>>> *Tesseract reads:* ١٦٩٧٦ //٠١//٧٢٨ *(24.txt)* >>>>> >>>>> *Text in image:* ٢٠١٥/١١/٢٢ *(12.jpg)* >>>>> *Tesseract reads:* ٢٠١٥/١١/٧٢ *(12.txt)* >>>>> >>>>> What i observed is I've issue in my training_text file. I've attached >>>>> the file above. Please guide me for this error as i have failed to find >>>>> any >>>>> solution myself. >>>>> >>>>> P.s. I've studied the Hallucination effect also which is given in the >>>>> wiki and tried to implement it as i understood, but no luck. >>>>> >>>>> -- >>>>> You received this message because you are subscribed to the Google >>>>> Groups "tesseract-ocr" group. >>>>> To unsubscribe from this group and stop receiving emails from it, send >>>>> an email to tesser...@googlegroups.com. >>>>> To view this discussion on the web visit >>>>> https://groups.google.com/d/msgid/tesseract-ocr/6878322a-c59 >>>>> 1-481b-b0d8-0befd76cbd22%40googlegroups.com >>>>> <https://groups.google.com/d/msgid/tesseract-ocr/6878322a-c591-481b-b0d8-0befd76cbd22%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>> . >>>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to tesser...@googlegroups.com. >>>> To view this discussion on the web visit https://groups.google.com/d/ms >>>> gid/tesseract-ocr/CAKGr270sh9rw1qzix%3Dhqgf3PwdGy8jDavrrBeCA >>>> nSUuNFdmrFw%40mail.gmail.com >>>> <https://groups.google.com/d/msgid/tesseract-ocr/CAKGr270sh9rw1qzix%3Dhqgf3PwdGy8jDavrrBeCAnSUuNFdmrFw%40mail.gmail.com?utm_medium=email&utm_source=footer> >>>> . >>>> >>> >>> >>> -- >>> >>> ____________________________________________________________ >>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >>> >> -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To view this discussion on the web visit https://groups.google.com/d/ > msgid/tesseract-ocr/7064e149-0f32-4072-8f50-9101ba341a51% > 40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/7064e149-0f32-4072-8f50-9101ba341a51%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CA%2BrEvXeAJxpsp30OyrFca8bNTCG02bSuLJQnFwmG5P3PLp1Htw%40mail.gmail.com.