*perfect!* Thank you very much <3 Thats what i was looking for. International Alphabet of Sanskrit Transliteration Characters.
Can tell me in which folder i must place the .traineddata? My configuration: tesseract 4.1.1 leptonica-1.79.0 libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 2.0.3) : libpng 1.6.37 : libtiff 4.1.0 : zlib 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.1 Found AVX Found SSE Found libarchive 3.4.0 zlib/1.2.11 liblzma/5.2.4 bz2lib/1.0.8 liblz4/1.9.2 libzstd/1.4.4 Many thanks again for your fast help On Saturday, October 24, 2020 at 3:12:15 PM UTC+2 shree wrote: > Ray has suggested using plus-minus type of training for adding a couple of > characters to the traineddata. Did you try that? > > Please share the training data you used (box/tiff pairs or lstmf files). > > I have done replace a layer training for Sanskrit. It adds the two > characters you want (in addition to many other required for Sanskrit > transliteration) . See sample image and attached output. The file is > available at > https://github.com/Shreeshrii/tess5training-sanskrit-iast/tree/main/tessdata/fast > > > > On Sat, Oct 24, 2020 at 5:31 PM Timo Struppi <mac...@gmail.com> wrote: > >> >> Hello, >> >> I dont want to invent the wheel new by creating a new language but how do >> i add the letters ṛ and ī to the OCR?? >> >> I tried a lot (vietOCR, Linux inteligent OCR solution, followed the few >> avaible tutorials etc) for several days but i am still not achieve to add a >> single letter. >> >> >> Many thanks in advance >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to tesseract-oc...@googlegroups.com. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/f23a9be3-dea4-46a6-8e21-dbe9c120d993n%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/f23a9be3-dea4-46a6-8e21-dbe9c120d993n%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> > > > -- > > ____________________________________________________________ > भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/ad7dcc6f-1232-4547-b3cb-f212eb1fdcf4n%40googlegroups.com.