[tesseract-ocr] Word by Word

2018-10-23 Thread MariamHi
can Tesseract segment image to words and detect each word separate and how can I do it by Tesseract API .NET Wrapper ? -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an em

Re: [tesseract-ocr] Multiple Languages

2018-10-21 Thread MariamHi
I have tried tessdata_fast, the result in English better but get worse in Arabic Result : SimplifiedArabic ?ع? ?ao oe " "? اموه ?جوجا ( ?Google? يستعد لاقتحام أدمغتنا ?الاحد 9 سبتمبر 2018 1 ?الاقتصادية" من الزياض" ?و ابه سب ?هل حدث وأن بحثت عن منتج معين عبر الإنترنت وتفاجأت باقتراحات عدي

RE: [tesseract-ocr] Multiple Languages

2018-10-16 Thread MariamHi
-ocr@googlegroups.com] On Behalf Of MariamHi Sent: 16 October 2018 13:19 To: tesseract-ocr@googlegroups.com Subject: RE: [tesseract-ocr] Multiple Languages Result: SimplifiedArabic 'جوجل 600916 " يستعد لاقتحام أدمغتثا الاحد 9 سبتمبر 20185 الاقتصادية" من الرياض" 0 هل

RE: [tesseract-ocr] Multiple Languages

2018-10-16 Thread MariamHi
0658 855108160؛ أن الكثير من خدمات جوجل على أجهزة آيفون 10110106 وآندرويد 8001010 تخزّن بيانات مواقع المستخدمين» حتى وإن قاموا بإيقاف تشغيل خدمات تحديد الموقع الجغرافي بتغيير إعدادات الخصوصية المتوفرة في تلك الأجهزة. From: MariamHi Sent: Tuesday, October 16, 2018 3:17 PM To: tesseract-ocr@goog

RE: [tesseract-ocr] Multiple Languages

2018-10-16 Thread MariamHi
: Adrian Owen Sent: Tuesday, October 16, 2018 1:23 PM To: tesseract-ocr@googlegroups.com Subject: RE: [tesseract-ocr] Multiple Languages try PageSegmentationMode.AUTO You may need to enlarge to 300, what’s original DPI? From: tesseract-ocr@googlegroups.com [mailto:tesseract-ocr@googlegroups.com]

RE: [tesseract-ocr] Multiple Languages

2018-10-16 Thread MariamHi
to replicate manual GIMP steps, that’s easy. I found greyscale didn’t help. YES: Long line removal (may not apply to you) (OpenCV) YES: resize to 300DPI YES: Apply filters Hope helps, Adrian From: tesseract-ocr@googlegroups.com [mailto:tesseract-ocr@googlegroups.com] On Behalf Of MariamHi Sent

RE: [tesseract-ocr] Multiple Languages

2018-10-15 Thread MariamHi
I did this but I have Bad recognition for English word .. what is the accuracy for multiple languages and how to improve it ? From: Adrian Owen Sent: Monday, October 15, 2018 3:35 PM To: tesseract-ocr Subject: Re: [tesseract-ocr] Multiple Languages Just list locales using + delimiter. Sent from