Re: [tesseract-ocr] Multiple Languages

2018-10-21 Thread MariamHi
I have tried tessdata_fast, the result in English better but get worse in Arabic Result : SimplifiedArabic ?ع? ?ao oe " "? اموه ?جوجا ( ?Google? يستعد لاقتحام أدمغتنا ?الاحد 9 سبتمبر 2018 1 ?الاقتصادية" من الزياض" ?و ابه سب ?هل حدث وأن بحثت عن منتج معين عبر الإنترنت وتفاجأت باقتراحات عدي

Re: [tesseract-ocr] Multiple Languages

2018-10-16 Thread Shree Devi Kumar
> > Please try with tessdata_fast -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To post to this group, send email

RE: [tesseract-ocr] Multiple Languages

2018-10-16 Thread MariamHi
Yes, using best data From: Adrian Owen Sent: Tuesday, October 16, 2018 3:44 PM To: tesseract-ocr@googlegroups.com Subject: RE: [tesseract-ocr] Multiple Languages Are you using the best data: https://github.com/tesseract-ocr/tessdata_best ? From: tesseract-ocr@googlegroups.com [mailto:tesseract

RE: [tesseract-ocr] Multiple Languages

2018-10-16 Thread Adrian Owen
Are you using the best data: https://github.com/tesseract-ocr/tessdata_best ? From: tesseract-ocr@googlegroups.com [mailto:tesseract-ocr@googlegroups.com] On Behalf Of MariamHi Sent: 16 October 2018 13:19 To: tesseract-ocr@googlegroups.com Subject: RE: [tesseract-ocr] Multiple Languages Result

RE: [tesseract-ocr] Multiple Languages

2018-10-16 Thread MariamHi
0658 855108160؛ أن الكثير من خدمات جوجل على أجهزة آيفون 10110106 وآندرويد 8001010 تخزّن بيانات مواقع المستخدمين» حتى وإن قاموا بإيقاف تشغيل خدمات تحديد الموقع الجغرافي بتغيير إعدادات الخصوصية المتوفرة في تلك الأجهزة. From: MariamHi Sent: Tuesday, October 16, 2018 3:17 PM To: tesseract-ocr@goog

RE: [tesseract-ocr] Multiple Languages

2018-10-16 Thread Adrian Owen
ي بتغيير إعدادات الخصوصية المتوفرة في تلك الأجهزة. From: Adrian Owen<mailto:adrian.o...@eesm.com> Sent: Tuesday, October 16, 2018 1:23 PM To: tesseract-ocr@googlegroups.com<mailto:tesseract-ocr@googlegroups.com> Subject: RE: [tesseract-ocr] Multiple Languages try PageSegmentationMode.AU

RE: [tesseract-ocr] Multiple Languages

2018-10-16 Thread MariamHi
: Adrian Owen Sent: Tuesday, October 16, 2018 1:23 PM To: tesseract-ocr@googlegroups.com Subject: RE: [tesseract-ocr] Multiple Languages try PageSegmentationMode.AUTO You may need to enlarge to 300, what’s original DPI? From: tesseract-ocr@googlegroups.com [mailto:tesseract-ocr@googlegroups.com]

RE: [tesseract-ocr] Multiple Languages

2018-10-16 Thread Adrian Owen
try PageSegmentationMode.AUTO You may need to enlarge to 300, what’s original DPI? From: tesseract-ocr@googlegroups.com [mailto:tesseract-ocr@googlegroups.com] On Behalf Of MariamHi Sent: 16 October 2018 11:07 To: tesseract-ocr@googlegroups.com Subject: RE: [tesseract-ocr] Multiple Languages

RE: [tesseract-ocr] Multiple Languages

2018-10-16 Thread Adrian Owen
Try changing order: English+Arabic Any better ? From: tesseract-ocr@googlegroups.com [mailto:tesseract-ocr@googlegroups.com] On Behalf Of MariamHi Sent: 16 October 2018 08:27 To: tesseract-ocr@googlegroups.com Subject: RE: [tesseract-ocr] Multiple Languages When I did pre-processing I get

RE: [tesseract-ocr] Multiple Languages

2018-10-16 Thread MariamHi
: Adrian Owen Sent: Monday, October 15, 2018 3:42 PM To: tesseract-ocr@googlegroups.com Subject: RE: [tesseract-ocr] Multiple Languages Gimp is your friend: https://stackoverflow.com/questions/9480013/image-processing-to-improve-tesseract-ocr-accuracy If your programming, use KalikoImage library

RE: [tesseract-ocr] Multiple Languages

2018-10-15 Thread Adrian Owen
) YES: resize to 300DPI YES: Apply filters Hope helps, Adrian From: tesseract-ocr@googlegroups.com [mailto:tesseract-ocr@googlegroups.com] On Behalf Of MariamHi Sent: 15 October 2018 13:38 To: tesseract-ocr@googlegroups.com Subject: RE: [tesseract-ocr] Multiple Languages I did this but I have Bad

RE: [tesseract-ocr] Multiple Languages

2018-10-15 Thread MariamHi
I did this but I have Bad recognition for English word .. what is the accuracy for multiple languages and how to improve it ? From: Adrian Owen Sent: Monday, October 15, 2018 3:35 PM To: tesseract-ocr Subject: Re: [tesseract-ocr] Multiple Languages Just list locales using + delimiter. Sent from

Re: [tesseract-ocr] Multiple Languages

2018-10-15 Thread Adrian Owen
Just list locales using + delimiter. Sent from my Huawei Mobile Original Message Subject: [tesseract-ocr] Multiple Languages From: Mariam Hijazi To: tesseract-ocr CC: Does tesseract support recognize multiple language in one document ? and how would do that ? Regards. -- You

[tesseract-ocr] Multiple Languages

2018-10-15 Thread Mariam Hijazi
Does tesseract support recognize multiple language in one document ? and how would do that ? Regards. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract