The tutorial has been written by Ray Smith. I haven't tested the plus-minus
as given.

Check whether the fonts you are using have the plus-minus sign.

Using one font is for the IMPACT tutorial with 400 iterations.

For plus-minus you need to use the larger list of fonts.

On Sat, Jun 23, 2018 at 1:13 AM Harathi Surya <harathisur...@gmail.com>
wrote:

> Sorry by mistake uploaded the wrong file. Please find the attached file
> for the output i got.
>
> Thanks,
> Harathi
>
> On Friday, June 22, 2018 at 12:41:25 PM UTC-7, Harathi Surya wrote:
>>
>> Thanks Shree,
>>
>> I followed the instructions and ran the following command:
>>
>> src/training/lstmtraining --stop_training   --continue_from
>> ~/tesstutorial/trainplusminus/plusminus_checkpoint   --traineddata
>> ~/tesstutorial/trainplusminus/eng/eng.traineddata   --model_output
>> ~/tesstutorial/trainplusminus/eng.traineddata
>>
>> Then i changed the TESSDATA_PREFIX to '/tesstutorial/trainplusminus'.
>> Then i tested the model with the image i attached in the previous email.
>> The output is little changed. But didnt get expected. '±' symbol is
>> replaced by '+' symbol. Please find the attached output file.
>> Training for more epochs may improve this?
>>
>> Thanks,
>> Harathi
>>
>> On Thursday, June 21, 2018 at 8:50:14 PM UTC-7, Harathi Surya wrote:
>>>
>>> Hi,
>>>
>>> I am trying to create .lstm files to finetune tesseract4.0.0 for new
>>> characters. I want to fine tune tesseract to recognize new characters like
>>> ±.
>>> What i tried:
>>> I added text that consists of the plus or minus symbol to the
>>> eng.training_text in langdata.
>>> Then I tried to run the following command
>>>
>>> src/training/tesstrain.sh --fonts_dir /usr/share/fonts --lang eng
>>> --linedata_only --noextract_font_properties --langdata_dir ../langdata
>>>  --tessdata_dir ./tessdata --output_dir ~/tesstutorial/trainplusminus
>>>
>>> I am getting the following error:
>>> ERROR: /tmp/tmp.3qWucNlYrH/eng/eng.Arial.exp0.box does not exist or is
>>> not readable
>>>
>>> The error repeated for all the font types.
>>>
>>> Can you please give some suggestions why this error occurs and how to
>>> solve this?
>>>
>>> Thanks in advance
>>> Harathi
>>>
>> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/18d951f5-7ef4-4f2f-9faf-9b1233c6c325%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/18d951f5-7ef4-4f2f-9faf-9b1233c6c325%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>


-- 

____________________________________________________________
भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWzNORdeVhz8POhBj3AF4zKFLpe3urXQN9zvKzbhDAspA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to