boxtiff file for arabic

2011-04-20 Thread Haydar
Can you please upload the boxtiff file you used for arabic? Thanks... -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@googlegroups.com To unsubscribe from this group, send email to tesseract-oc

Re: boxtiff file for arabic

2011-04-20 Thread Haydar
Ok, sorry for inconvenience... I read that. However, when I run tesseract with "-l ara", it can do something and there is a ara.traineddata file. So there must be some box files, tiff files, which I am asking about... Thanks... -- You received this message because you are subscribed to the Googl

Re: boxtiff file for arabic

2011-04-22 Thread Haydar
more question, is there a guide that I can look for preparing the font_properties file, I don't know some properties of the fonts? Thanks... -haydar -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email t

How to read/write from right to left?

2011-05-11 Thread Haydar
Hi, I know that tesseract does not support right-to-left languages yet. But I would like to ask if there is a way to make tesseract read the text-image or write the output from right-to-left. Thanks in advance... Regards, --haydar -- You received this message because you are subscribed to the

Problem with unicharambigs file

2011-05-25 Thread Haydar
decided to create a arb.unicharambigs file which contains sth like this: v1 1 wrong string 1 right string1 Then i tried again, but it doesn't seem to change. Is there anything I am missing? Thanks in advance... -haydar -- You received this message because you are subscr

text-direction problem after training

2011-06-24 Thread Haydar
e the reason of this? How can I solve that issue? Thanks... -haydar -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to tesseract-ocr@googlegroups.com To unsubscribe from this group, send e

Re: Regarding Tesseract 3.0 training

2011-06-24 Thread Haydar
ds-dawg) - I found nearly 4000 frequently used words for English (eng.freq_word_list -> eng.freq-dawg) - Then I follwed the procedure from the link http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3 and that's it. Hope, it will help you... -haydar On Jun 24, 7:14 am, Sandeep Parmar

Re: Regarding Tesseract 3.0 training

2011-06-25 Thread Haydar
unicharambigs file. Regards, -haydar On Jun 25, 6:46 am, Sandeep Parmar wrote: > Hi Haydar, > > Thanks for replying, but i have the following queries, > >    1. Did your trained data improve the overall accuracy of OCR? >    2. Did your trained data give correct results fo