The tutorial has been written by Ray Smith. I haven't tested the plus-minus as given.
Check whether the fonts you are using have the plus-minus sign. Using one font is for the IMPACT tutorial with 400 iterations. For plus-minus you need to use the larger list of fonts. On Sat, Jun 23, 2018 at 1:13 AM Harathi Surya <harathisur...@gmail.com> wrote: > Sorry by mistake uploaded the wrong file. Please find the attached file > for the output i got. > > Thanks, > Harathi > > On Friday, June 22, 2018 at 12:41:25 PM UTC-7, Harathi Surya wrote: >> >> Thanks Shree, >> >> I followed the instructions and ran the following command: >> >> src/training/lstmtraining --stop_training --continue_from >> ~/tesstutorial/trainplusminus/plusminus_checkpoint --traineddata >> ~/tesstutorial/trainplusminus/eng/eng.traineddata --model_output >> ~/tesstutorial/trainplusminus/eng.traineddata >> >> Then i changed the TESSDATA_PREFIX to '/tesstutorial/trainplusminus'. >> Then i tested the model with the image i attached in the previous email. >> The output is little changed. But didnt get expected. '±' symbol is >> replaced by '+' symbol. Please find the attached output file. >> Training for more epochs may improve this? >> >> Thanks, >> Harathi >> >> On Thursday, June 21, 2018 at 8:50:14 PM UTC-7, Harathi Surya wrote: >>> >>> Hi, >>> >>> I am trying to create .lstm files to finetune tesseract4.0.0 for new >>> characters. I want to fine tune tesseract to recognize new characters like >>> ±. >>> What i tried: >>> I added text that consists of the plus or minus symbol to the >>> eng.training_text in langdata. >>> Then I tried to run the following command >>> >>> src/training/tesstrain.sh --fonts_dir /usr/share/fonts --lang eng >>> --linedata_only --noextract_font_properties --langdata_dir ../langdata >>> --tessdata_dir ./tessdata --output_dir ~/tesstutorial/trainplusminus >>> >>> I am getting the following error: >>> ERROR: /tmp/tmp.3qWucNlYrH/eng/eng.Arial.exp0.box does not exist or is >>> not readable >>> >>> The error repeated for all the font types. >>> >>> Can you please give some suggestions why this error occurs and how to >>> solve this? >>> >>> Thanks in advance >>> Harathi >>> >> -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To post to this group, send email to tesseract-ocr@googlegroups.com. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/18d951f5-7ef4-4f2f-9faf-9b1233c6c325%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/18d951f5-7ef4-4f2f-9faf-9b1233c6c325%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWzNORdeVhz8POhBj3AF4zKFLpe3urXQN9zvKzbhDAspA%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.