Re: [tesseract-ocr] Re: How can I do the training using my own image in Tesseract 4.0

2021-01-09 Thread Kay-Michael Würzner
Of course you can! Just checkout the tesstrain tool: https://github.com/tesseract-ocr/tesstrain Cheers, Kay On Fri, Jan 8, 2021 at 9:32 AM Murtuza Dahodwala wrote: > > > It is now 2 years since this answer was posted. Is it possible to train > tesseract 4 on real images now? > On Thursday, Janu

Re: [tesseract-ocr] Re: How can I do the training using my own image in Tesseract 4.0

2017-02-09 Thread Kay-Michael Würzner
sample. >>> >>> Then modify tesstrain.sh to copy the box tiff pairs to the training >>> directory before starting training >>> >>> >>> >>> mkdir -p ${TRAINING_DIR} >>> tlog "\n=== Starting training for language '${LANG_CO

[tesseract-ocr] Re: How can I do the training using my own image in Tesseract 4.0

2017-02-07 Thread Kay-Michael Würzner
+1 for this question. The training documentation for Tesseract 4.0 by now only covers training with font files (synthetic materials). What is missing is information on training with real data (i.e. manually aligned ground truth). Any hints on that matter are greatly appreciated. Cheers, Kay On

Re: [tesseract-ocr] Tesseract 4.0: VGSLSpecs

2016-12-16 Thread Kay-Michael Würzner
the whole process work? Cheers, Kay On Friday, December 16, 2016 at 5:36:02 PM UTC+1, shree wrote: > > Did you try out the commands as per the LSTM training tutorial? > > On 16-Dec-2016 8:31 PM, "Kay-Michael Würzner" > wrote: > >> Dear @, >> >> I

[tesseract-ocr] Tesseract 4.0: VGSLSpecs

2016-12-16 Thread Kay-Michael Würzner
Dear @, I played around with training the new LSTM mode. According to the documentation of the network specification (https://github.com/tesseract-ocr/tesseract/wiki/VGSLSpecs), the last number in the first tupel called 'depth' corresponds to the type of input (i.e. 1 ... grayscale, 3 ... colo