training/lstmtraining --model_output /path/to/output [--max_image_MB 6000] \ --continue_from /path/to/existing/model \ --traineddata /path/to/original/traineddata \ [--perfect_sample_delay 0] [--debug_interval 0] \ [--max_iterations 0] [--target_error_rate 0.01] \ --train_listfile /path/to/list/of/filenames.txt
In this command, what should be passed to the argument *continue_from* and *traineddata*? I'm a bit confused. On Wednesday, 24 October 2018 17:11:16 UTC+5:30, Soumik Ranjan Dasgupta wrote: > > Please see tesseract/src/training/language_specific.sh > You need to add the fonts under the respective category after > installation. > > On Wed, Oct 24, 2018, 5:04 PM <vivek....@teknowmics.co.in <javascript:>> > wrote: > >> 'Add the same in font_properties and language_specific.sh' ? Can you >> please elaborate? Thank you >> >> On Wednesday, 17 October 2018 20:18:26 UTC+5:30, Soumik Ranjan Dasgupta >> wrote: >>> >>> You'll need to install the fonts in your system add the same in >>> font_properties and language_specific.sh for fine-tuning or training from >>> scratch. For further details please see >>> https://github.com/tesseract-ocr/tesseract/issues/1672. >>> >>> On Tue, Oct 16, 2018 at 3:40 PM kislay bajpai <kislay....@gmail.com> >>> wrote: >>> >>>> Hello, >>>> >>>> Thanks for prompt reply, I want to train tesseract 4.0 alpha for font >>>> E13B. How could i train? Please share the knowledge. >>>> >>>> On Tuesday, October 16, 2018 at 1:57:17 PM UTC+5:30, Soumik Ranjan >>>> Dasgupta wrote: >>>>> >>>>> Please see >>>>> https://github.com/tesseract-ocr/tesseract/wiki/Fonts#fonts-for-tesseract-training >>>>> . >>>>> >>>>> On Tue, Oct 16, 2018 at 1:49 PM <kislay...@imageinfosystems.com> >>>>> wrote: >>>>> >>>>>> Hello all, >>>>>> >>>>>> I want to train tesseract 4.0 alpha for a new font, is there anyone >>>>>> who can help me on this topic. >>>>>> >>>>>> On Monday, May 14, 2018 at 7:15:15 PM UTC+5:30, reza wrote: >>>>>>> >>>>>>> hi >>>>>>> i tested tesseract 4 beta on persian lang , the results was good. >>>>>>> but i think needs more training on more fonts and texts. >>>>>>> how could we train more fonts and texts on model that exist in >>>>>>> tesseract 4 beta for persian lang ? >>>>>>> >>>>>>> and last question is, how could we apply dictionary to correct that >>>>>>> words OCRing with error ? >>>>>>> >>>>>>> thanks >>>>>>> >>>>>> -- >>>>>> You received this message because you are subscribed to the Google >>>>>> Groups "tesseract-ocr" group. >>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>> send an email to tesseract-oc...@googlegroups.com. >>>>>> To post to this group, send email to tesser...@googlegroups.com. >>>>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>>>> To view this discussion on the web visit >>>>>> https://groups.google.com/d/msgid/tesseract-ocr/1ee9528e-d8fd-4438-9cd0-4925ae7763d5%40googlegroups.com >>>>>> >>>>>> <https://groups.google.com/d/msgid/tesseract-ocr/1ee9528e-d8fd-4438-9cd0-4925ae7763d5%40googlegroups.com?utm_medium=email&utm_source=footer> >>>>>> . >>>>>> For more options, visit https://groups.google.com/d/optout. >>>>>> >>>>> >>>>> >>>>> -- >>>>> Regards, >>>>> Soumik Ranjan Dasgupta >>>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to tesseract-oc...@googlegroups.com. >>>> To post to this group, send email to tesser...@googlegroups.com. >>>> Visit this group at https://groups.google.com/group/tesseract-ocr. >>>> To view this discussion on the web visit >>>> https://groups.google.com/d/msgid/tesseract-ocr/72b70562-15f4-4b6f-96a9-62b6d792980c%40googlegroups.com >>>> >>>> <https://groups.google.com/d/msgid/tesseract-ocr/72b70562-15f4-4b6f-96a9-62b6d792980c%40googlegroups.com?utm_medium=email&utm_source=footer> >>>> . >>>> For more options, visit https://groups.google.com/d/optout. >>>> >>> >>> >>> -- >>> Regards, >>> Soumik Ranjan Dasgupta >>> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to tesseract-oc...@googlegroups.com <javascript:>. >> To post to this group, send email to tesser...@googlegroups.com >> <javascript:>. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/8eafa0fa-6129-4c87-a53b-ae8a5659ae79%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/8eafa0fa-6129-4c87-a53b-ae8a5659ae79%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email to tesseract-ocr@googlegroups.com. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/d374762e-28e2-4118-847f-edec3065b3a8%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.