What is inside your training_text file? I had similar issue when the lines in that file were too long... try to make them as short as 5-7 words (and then break it with a newline)
On Thursday, September 23, 2021 at 10:07:26 AM UTC+3 Meet Yogi wrote: > I'm using command > tesseract tiff_file_path name_of_lstm_file lstm.train > for example > tesseract batch3.tiff batch3 lstm.train > > while doing so I'm getting the following error > Tesseract Open Source OCR Engine v4.1.1 with Leptonica > Page 1 > Warning: Invalid resolution 0 dpi. Using 70 instead. > Estimating resolution as 335 > Empty page!! > Page 2 > Warning: Invalid resolution 0 dpi. Using 70 instead. > Estimating resolution as 316 > *Deserialize header failed:* > /home/petpooja/meet/ocr/tesseract_annotated_images/training_data_phase_1/cn_id_in_gstno_cleaned_images/batch3.lstmf > *Failed to read training data from* > /home/petpooja/meet/ocr/tesseract_annotated_images/training_data_phase_1/cn_id_in_gstno_cleaned_images/batch3.lstmf! > *Error during processing*. > > How can it be solved? > I have checked the box file and verified annotations there is no issue > with it > > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/22aec03c-2b32-48ff-ac97-bf97bc87193dn%40googlegroups.com.