Hey Yasin,
Sorry to reply so late. As far as I know, Tesseract doesn't work on MacOs
yet. Maybe you can install a Linux environment inside a VM and make-do with
it?
No, You don't have to create box files manually, tesstrain.sh will do that
for you. In fact, it will take care of the entire training procedure.
If you want to fine-tune
<https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00#fine-tuning-for--a-few-characters>,
you have to specify the modified architecture in the VGSL specifications as
the CLI parameter
<https://github.com/tesseract-ocr/tesseract/wiki/VGSLSpecs>.
In order to train Tesseract on a custom fontslist, you'd have to install
them and then mention the names in two separate files - the font_properties
<https://github.com/tesseract-ocr/tesseract/wiki/Training-Tesseract-3.03%E2%80%933.05#the-font_properties-file>
file, and the language-specific.sh
<https://github.com/tesseract-ocr/tesseract/blob/master/src/training/language-specific.sh\>
file. Note that in both files, you need to enlist the fonts in a particular
format.
The traineddata for tesseract 3 is not compatible with the version 4, so
it's better if you train from scratch
<https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00#training-from-scratch>
.
Do get back to me if you have any more queries.

On Sat, Aug 25, 2018 at 3:00 PM Yasin Nazlıcan <yasin.nazli...@gmail.com>
wrote:

> Hey Soumik Ranjan,
>
> Thank you for reply, mate. Like I said, I tried the follow this
> documentation
> <https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract-4.00#hardware-software-requirements>,
> but I couldn't go further. I couldn't find any info about macOS and had to
> stop. I assume I should create boxes for font and text and make
> fine-tuning. Do you have any links for macOS, that I can follow? Also, if
> you don't mind could you give me some more explanation about the process?
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/2104120f-c23e-4959-8987-abbf30102ddf%40googlegroups.com
> <https://groups.google.com/d/msgid/tesseract-ocr/2104120f-c23e-4959-8987-abbf30102ddf%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>


-- 
Regards,
Soumik Ranjan Dasgupta

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAB_aDAf2--nfM4HYdDP%3Dvbz7crBjcAWZOjgzoYnoZN0b5UTv1Q%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to