Quite a few of these handwriting fonts are uppercase letters only (so 
lowercase come out as uppercase when typed) . What is the best type of 
[lang].training_text data to use for training these - is it uppercase only?

On Thursday, June 21, 2018 at 10:24:11 AM UTC+1, shree wrote:
>
> I had tried training with the handwriting font you mentioned in first 
> message. 
>
> I think that font has same shapes for capitals as well as lower case 
> letters.
>
> So recognition rates will be lower for it.
>
> On Thu 21 Jun, 2018, 1:49 PM Navaneetha Bitla, <neeth...@gmail.com 
> <javascript:>> wrote:
>
>> yeah i've tried to train with these images but its giving dpi etc error. 
>>
>> Then i've moved to ttf font then converted ttf to tiff finally trained 
>> the data but output is very bad, i dont know whether bad results for 
>> training process or dataser.
>>
>> Still trying to make progress.
>>
>> On Thu, Jun 21, 2018 at 12:24 PM, chandra churh chatterjee <
>> chandrachurh...@gmail.com <javascript:>> wrote:
>>
>>> Excuse me @Shree Devi Kumar can you please tell me whether data for 
>>> training tesseract 4.0 would be better if the data has images which have 
>>> paragraphed hand written texts 
>>> or single character based texts as follows
>>>
>>> On Wed, Jun 20, 2018 at 9:00 PM Shree Devi Kumar <shree...@gmail.com 
>>> <javascript:>> wrote:
>>>
>>>> You will have better control on training if you use tesstrain.sh 
>>>> provided with tesseract.
>>>>
>>>> On Wed, Jun 20, 2018 at 8:52 PM Navaneetha Bitla <neeth...@gmail.com 
>>>> <javascript:>> wrote:
>>>>
>>>>> http://www.1001fonts.com/handwritten-fonts.html.
>>>>>
>>>>> the above link has 1900+ fonts from that site i have downloaded the 
>>>>> ttf files of fonts and converted to tiff files online.
>>>>>
>>>>> then i have trained the tiff files(fonts) using serak trainer.
>>>>>
>>>>>
>>>>> If you got the accuracy just forward the results so everyone can konw 
>>>>> and will follw you.
>>>>>
>>>>> Thank you
>>>>>
>>>>> On Wed, Jun 20, 2018 at 3:13 PM, James Q <james.qu...@taina.tech 
>>>>> <javascript:>> wrote:
>>>>>
>>>>>> I'm going to be using tesseract 4 and using the tesstrain.sh script. 
>>>>>> If I come across things that improve accuracy though I will let you know.
>>>>>>
>>>>>> Where did you find 1300 handwriting fonts?
>>>>>>
>>>>>> On Tuesday, June 19, 2018 at 5:19:54 PM UTC+1, Navaneetha Bitla wrote:
>>>>>>>
>>>>>>> serak trainer using training tesseract 3.5.
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> On Tue, Jun 19, 2018 at 9:29 PM, James Q <james.qu...@taina.tech> 
>>>>>>> wrote:
>>>>>>>
>>>>>>>> Hi Navaneetha
>>>>>>>> I am also looking to start training tesseract using handwritten 
>>>>>>>> fonts and am about to start setting up my training environment. Are 
>>>>>>>> you 
>>>>>>>> training tesseract 4 by following the guide at 
>>>>>>>> https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00 
>>>>>>>> ?
>>>>>>>>
>>>>>>>> If so are you fine tuning the existing english model, retraining 
>>>>>>>> just the top layer(s) or training from scratch with your additional 
>>>>>>>> fonts?
>>>>>>>>
>>>>>>>> Thanks
>>>>>>>> Jim
>>>>>>>>
>>>>>>>> On Tuesday, June 19, 2018 at 10:30:30 AM UTC+1, Navaneetha Bitla 
>>>>>>>> wrote:
>>>>>>>>>
>>>>>>>>> Hi, this is Navaneetha 
>>>>>>>>>
>>>>>>>>> i'm working in hand written character recognition project. 
>>>>>>>>>
>>>>>>>>> I have trained 1300 different hand written fonts of english and 
>>>>>>>>> moved the files into tessdata directory.
>>>>>>>>>
>>>>>>>>> tested tesseract using the below commands:
>>>>>>>>>
>>>>>>>>> $convert -density 300 input.png -depth 8 -strip -background white 
>>>>>>>>> -alpha off out.tiff
>>>>>>>>>
>>>>>>>>>  $tesseract out.tiff eng
>>>>>>>>>
>>>>>>>>> The input.png is of Alanis Handa font and i have trained this font 
>>>>>>>>> but i'm not getting atleast 40% accuracy.
>>>>>>>>>
>>>>>>>>> Can someone help me.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> Thanks in advance.
>>>>>>>>>
>>>>>>>> -- 
>>>>>>>> You received this message because you are subscribed to the Google 
>>>>>>>> Groups "tesseract-ocr" group.
>>>>>>>> To unsubscribe from this group and stop receiving emails from it, 
>>>>>>>> send an email to tesseract-oc...@googlegroups.com.
>>>>>>>> To post to this group, send email to tesser...@googlegroups.com.
>>>>>>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>>>>>>> To view this discussion on the web visit 
>>>>>>>> https://groups.google.com/d/msgid/tesseract-ocr/253906ac-fedf-4364-ad70-e745b8786c0d%40googlegroups.com
>>>>>>>>  
>>>>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/253906ac-fedf-4364-ad70-e745b8786c0d%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>>>>>> .
>>>>>>>>
>>>>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>>>>
>>>>>>>
>>>>>>> -- 
>>>>>> You received this message because you are subscribed to the Google 
>>>>>> Groups "tesseract-ocr" group.
>>>>>> To unsubscribe from this group and stop receiving emails from it, 
>>>>>> send an email to tesseract-oc...@googlegroups.com <javascript:>.
>>>>>> To post to this group, send email to tesser...@googlegroups.com 
>>>>>> <javascript:>.
>>>>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>>>>> To view this discussion on the web visit 
>>>>>> https://groups.google.com/d/msgid/tesseract-ocr/29a1bc53-d127-407b-8611-0652821a0707%40googlegroups.com
>>>>>>  
>>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/29a1bc53-d127-407b-8611-0652821a0707%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>>>> .
>>>>>>
>>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>>
>>>>>
>>>>> -- 
>>>>> You received this message because you are subscribed to the Google 
>>>>> Groups "tesseract-ocr" group.
>>>>> To unsubscribe from this group and stop receiving emails from it, send 
>>>>> an email to tesseract-oc...@googlegroups.com <javascript:>.
>>>>> To post to this group, send email to tesser...@googlegroups.com 
>>>>> <javascript:>.
>>>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>>>> To view this discussion on the web visit 
>>>>> https://groups.google.com/d/msgid/tesseract-ocr/CABbi8QfEe2r%2BynHHEGfr8_b-x5KOf2yJ1xr%2Be7e1sDCKxqUFXA%40mail.gmail.com
>>>>>  
>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/CABbi8QfEe2r%2BynHHEGfr8_b-x5KOf2yJ1xr%2Be7e1sDCKxqUFXA%40mail.gmail.com?utm_medium=email&utm_source=footer>
>>>>> .
>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>
>>>>
>>>>
>>>> -- 
>>>>
>>>> ____________________________________________________________
>>>> भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com
>>>>
>>>> -- 
>>>> You received this message because you are subscribed to the Google 
>>>> Groups "tesseract-ocr" group.
>>>> To unsubscribe from this group and stop receiving emails from it, send 
>>>> an email to tesseract-oc...@googlegroups.com <javascript:>.
>>>> To post to this group, send email to tesser...@googlegroups.com 
>>>> <javascript:>.
>>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>>> To view this discussion on the web visit 
>>>> https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduU4w%2BjPakoNOdzq6QyS3nF9rAp9gHSPUkKddioZTXsgyw%40mail.gmail.com
>>>>  
>>>> <https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduU4w%2BjPakoNOdzq6QyS3nF9rAp9gHSPUkKddioZTXsgyw%40mail.gmail.com?utm_medium=email&utm_source=footer>
>>>> .
>>>> For more options, visit https://groups.google.com/d/optout.
>>>>
>>> -- 
>>> You received this message because you are subscribed to the Google 
>>> Groups "tesseract-ocr" group.
>>> To unsubscribe from this group and stop receiving emails from it, send 
>>> an email to tesseract-oc...@googlegroups.com <javascript:>.
>>> To post to this group, send email to tesser...@googlegroups.com 
>>> <javascript:>.
>>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>>> To view this discussion on the web visit 
>>> https://groups.google.com/d/msgid/tesseract-ocr/CAD_EDkYpHgUU7O%3DRRTP--3-QLbSQntRgbuTeH5vPcW_gStF-zQ%40mail.gmail.com
>>>  
>>> <https://groups.google.com/d/msgid/tesseract-ocr/CAD_EDkYpHgUU7O%3DRRTP--3-QLbSQntRgbuTeH5vPcW_gStF-zQ%40mail.gmail.com?utm_medium=email&utm_source=footer>
>>> .
>>>
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to tesseract-oc...@googlegroups.com <javascript:>.
>> To post to this group, send email to tesser...@googlegroups.com 
>> <javascript:>.
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/CABbi8QfCTBnTS-bRXKUNA5Fc5cAuEvZ10gyxVGHFh0%2B02WrcGg%40mail.gmail.com
>>  
>> <https://groups.google.com/d/msgid/tesseract-ocr/CABbi8QfCTBnTS-bRXKUNA5Fc5cAuEvZ10gyxVGHFh0%2B02WrcGg%40mail.gmail.com?utm_medium=email&utm_source=footer>
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/9a5ccd27-4693-476a-b730-4f19293bf380%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to