<https://stackoverflow.com/posts/79526256/timeline>

I’ve been following this tutorial from YouTube: Guide to Tesseract Training 
https://www.youtube.com/watch?v=KE4xEzFGSU8&t=13s and its corresponding 
GitHub repository: astutejoe/tesseract_tutorial. 
https://github.com/astutejoe/tesseract_tutorial

The tutorial walks through the process of training a custom Tesseract 
model, but I've run into an issue when trying to continue training the model

*What we tried*: Setup: I followed the steps in the tutorial to set up the 
environment, downloaded the necessary files, and began the training process 
using the base eng.traineddata model.

*Training Command*: After preparing the training data and ground truth, I 
ran the following command to initiate the training:
make training MODEL_NAME=Apex START_MODEL=eng 
TESSDATA=../tesseract/tessdata MAX_ITERATIONS=100 

*Model Generation*: This command successfully generated the Apex.lstm model 
file. However, I encountered an issue when trying to use the Apex.lstm file 
for further training.

*Error:* When attempting to continue training the model, I received the 
following error:Error, data/eng/Apex.lstm is an integer (fast) model, 
cannot continue training

**What we faced:**I have verified that the eng.traineddata file is located 
correctly in /usr/share/tesseract-ocr/5/tessdata/ (path may differ 
depending on installation).Despite following the tutorial and using the 
correct paths for the eng.traineddata,

I’m getting an error related to the model being an "integer model" and 
unable to continue training.I tried downloading the latest eng.traineddata 
from GitHub, but the error persists.

*Questions*: What does the "integer (fast) model" error mean, and how can I 
resolve it? Is there something I missed in the training process that would 
allow me to continue training Apex.lstm? Any advice or insights would be 
greatly appreciated. *Environment*: Tesseract version: 5.3.0 OS: Ubuntu 
20.04 (MacBook Pro) Tesseract Data Path: 
/usr/share/tesseract-ocr/5/tessdata/Base Model: eng.traineddata Makefile: 
https://github.com/tesseract-ocr/tesstrain/blob/43ff10012af31914bb5b72304d9c21c8fdf4f464/Makefile

Thank you in advance for your help!

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion visit 
https://groups.google.com/d/msgid/tesseract-ocr/d09b45da-1e8a-4194-ad28-505857f0ad54n%40googlegroups.com.

Reply via email to