Hi, Please find the detail below:
1. Dataset: It's available in marathi handwritten zip folder here https://github.com/codeatpanorama/training-data/blob/main/marathi_handwritten_text.zip 2. Log : Tesstrain.log file - https://github.com/codeatpanorama/training-data/tree/main/logs 3. Command Used: nohup make training MODEL_NAME=mar_hw START_MODEL=mar TESSDATA=tessdata_best MAX_ITERATIONS=10000 LANG_TYPE=Indic > plot/TESSTRAIN.LOG & 4. We installed mar.traineddata using this command wget https://github.com/tesseract-ocr/tessdata/raw/main/mar.traineddata <https://github.com/tesseract-ocr/tessdata/raw/main/eng.traineddata> -P tessdata_best This is out of tesseract version: tesseract 4.1.1 leptonica-1.79.0 libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 2.0.3) : libpng 1.6.37 : libtiff 4.1.0 : zlib 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.1 Found libarchive 3.4.0 zlib/1.2.11 liblzma/5.2.4 bz2lib/1.0.8 liblz4/1.9.2 libzstd/1.4.4 Please help us unblock here. Also, When we omit the START_MODEL flag and solely provide the TESSDATA path, which base model does Tesseract use for training? Thanks! -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/603edca0-9cc7-4895-8fdf-db035cda6f36n%40googlegroups.com.