Hi,

Please find the detail below:

1. Dataset: It's available in marathi handwritten zip folder here 
https://github.com/codeatpanorama/training-data/blob/main/marathi_handwritten_text.zip
2. Log : Tesstrain.log file - 
https://github.com/codeatpanorama/training-data/tree/main/logs
3. Command Used: nohup make training MODEL_NAME=mar_hw START_MODEL=mar 
TESSDATA=tessdata_best MAX_ITERATIONS=10000 LANG_TYPE=Indic > 
plot/TESSTRAIN.LOG &
4. We installed mar.traineddata using this command wget 
https://github.com/tesseract-ocr/tessdata/raw/main/mar.traineddata 
<https://github.com/tesseract-ocr/tessdata/raw/main/eng.traineddata> -P 
tessdata_best

This is out of tesseract version:

tesseract 4.1.1

 leptonica-1.79.0

  libgif 5.1.4 : libjpeg 8d (libjpeg-turbo 2.0.3) : libpng 1.6.37 : libtiff 
4.1.0 : zlib 1.2.11 : libwebp 0.6.1 : libopenjp2 2.3.1

 Found libarchive 3.4.0 zlib/1.2.11 liblzma/5.2.4 bz2lib/1.0.8 liblz4/1.9.2 
libzstd/1.4.4

Please help us unblock here.

Also, When we omit the START_MODEL flag and solely provide the TESSDATA 
path, which base model does Tesseract use for training?

Thanks!

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/603edca0-9cc7-4895-8fdf-db035cda6f36n%40googlegroups.com.

Reply via email to