I am not familiar (yet ;-) ) with android development (OK I was testing build system/portability of tesseract 3.03 based on "tess-two", but IMO training part is more difficult to compile than tesseract library)).
Suggestions for stat-up: - I would start with 3.02 version because 3.03 needs additional external libraries (icu) - point 1. in you list can be done with tesseract library - Makefile.am in training directory is "outline" for building training libraries. If you understand it you will see how to build executable, what are their dependencies etc. - for this topic - (porting to android), dev forum could be better place Zdenko On Wed, Mar 5, 2014 at 11:59 AM, Tushar Makkar <[email protected]>wrote: > Thanks a lot . > Training the data on client is a must for us since we are aiming to make > an entirely different language which will be specific to a particular > individual.Basically we need to use these all commands for making the > training data which we are unable to port and find suitable alternative : > 1) batch.nochop makebox > 2) cntraining > 3) mftraining > 4) shapeclustering > 5) combine > 6) unicharset_extractor > > Could you give any leads regarding porting these commands . > > Thanks in advance > > On Wednesday, March 5, 2014 3:24:36 PM UTC+5:30, zdenop wrote: > >> You need to port all training tools to android. >> >> Generally (my opinion): >> >> 1. Unless you have proof that you MUST do custom training - training >> is wasting of time (nobody was able to create better language data for the >> existing language and common fonts at Google) >> 2. Unless you do not understand training process (probably you will >> need to read the source code) - training is wasting of time >> >> >> >> Zdenko >> >> >> On Wed, Mar 5, 2014 at 9:39 AM, Tushar Makkar <[email protected]>wrote: >> >>> I am using the tess-two (https://github.com/rmtheis/tess-two) library >>> for OCR recognition on Android . I want to create the training data on >>> Android . I have followed https://code.google.com/p/tesseract-ocr/wiki/ >>> TrainingTesseract3 and successfully created training data on linux >>> system . How to do the same on Android using tess-two or any other library >>> ? >>> >>> -- >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To post to this group, send email to [email protected] >>> >>> To unsubscribe from this group, send email to >>> [email protected] >>> >>> For more options, visit this group at >>> http://groups.google.com/group/tesseract-ocr?hl=en >>> >>> --- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to [email protected]. >>> >>> For more options, visit https://groups.google.com/groups/opt_out. >>> >> >> -- > -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to [email protected] > To unsubscribe from this group, send email to > [email protected] > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > > --- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > For more options, visit https://groups.google.com/groups/opt_out. > -- -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en --- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.

