[tesseract-ocr] Re: Creating a new language pack for Javanese Script

shree Mon, 23 Apr 2018 01:07:31 -0700

Please see https://github.com/tesseract-ocr/langdata/issues/126


Replying there.

On Monday, April 23, 2018 at 2:16:06 AM UTC+5:30, Christopher Imantaka 
Halim wrote:
>
> Hi,
>
> I want to develop an OCR for Javanese Script / Aksara.
> https://en.wikipedia.org/wiki/Javanese_script
>
> Plan on using Tesseract version 4.0
> I've read the wiki but somehow got confused.
>
> What do I need to prepare, to start the bare minimum training process? 
> (for Tesseract 4.0)
> In some other thread someone said that training using image files are not 
> supported yet.
> Also found out that box file/tiff pairs are not supported also.
> (I did try making one box file, using this online tool: 
> https://pp19dd.com/tesseract-ocr-chopper/)
>
> Do we have an example of the training "inputs" somewhere on the github 
> projects?
>
> Sorry if this is a stupid question, I'm a newbie. :)
>
> Thanks before
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/352c634a-33d7-43cf-b2eb-58b9385b93a7%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[tesseract-ocr] Re: Creating a new language pack for Javanese Script

Reply via email to