Help! I get following errorcode. What am i doing wrong? Error opening data file /usr/share/tesseract-ocr/4.00/tessdata/Sanskrit-1017-fast.traineddata Please make sure the TESSDATA_PREFIX environment variable is set to your "tessdata" directory. Failed loading language 'Sanskrit-1017-fast' Tesseract couldn't load any languages! Could not initialize tesseract.
On Saturday, October 24, 2020 at 5:53:55 PM UTC+2 Timo Struppi wrote: > *perfect!* Thank you very much <3 Thats what i was looking for. > International Alphabet of Sanskrit Transliteration Characters. > > Can tell me in which folder i must place the .traineddata? > > My configuration: > tesseract 4.1.1 > leptonica-1.79.0 > libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 2.0.3) : libpng 1.6.37 : > libtiff 4.1.0 : zlib 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.1 > Found AVX > Found SSE > Found libarchive 3.4.0 zlib/1.2.11 liblzma/5.2.4 bz2lib/1.0.8 > liblz4/1.9.2 libzstd/1.4.4 > > Many thanks again for your fast help > > On Saturday, October 24, 2020 at 3:12:15 PM UTC+2 shree wrote: > >> Ray has suggested using plus-minus type of training for adding a couple >> of characters to the traineddata. Did you try that? >> >> Please share the training data you used (box/tiff pairs or lstmf files). >> >> I have done replace a layer training for Sanskrit. It adds the two >> characters you want (in addition to many other required for Sanskrit >> transliteration) . See sample image and attached output. The file is >> available at >> https://github.com/Shreeshrii/tess5training-sanskrit-iast/tree/main/tessdata/fast >> >> >> >> On Sat, Oct 24, 2020 at 5:31 PM Timo Struppi <mac...@gmail.com> wrote: >> >>> >>> Hello, >>> >>> I dont want to invent the wheel new by creating a new language but how >>> do i add the letters ṛ and ī to the OCR?? >>> >>> I tried a lot (vietOCR, Linux inteligent OCR solution, followed the few >>> avaible tutorials etc) for several days but i am still not achieve to add a >>> single letter. >>> >>> >>> Many thanks in advance >>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to tesseract-oc...@googlegroups.com. >>> To view this discussion on the web visit >>> https://groups.google.com/d/msgid/tesseract-ocr/f23a9be3-dea4-46a6-8e21-dbe9c120d993n%40googlegroups.com >>> >>> <https://groups.google.com/d/msgid/tesseract-ocr/f23a9be3-dea4-46a6-8e21-dbe9c120d993n%40googlegroups.com?utm_medium=email&utm_source=footer> >>> . >>> >> >> >> -- >> >> ____________________________________________________________ >> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/bc9993ae-9c0e-4b8b-8783-3464ac2278bdn%40googlegroups.com.