[tesseract-ocr] Re: why such simple word can't be recognized?

2018-08-16 Thread xllacyx
all scaling images 在 2018年8月17日星期五 UTC+8上午1:24:27,xll...@gmail.com写道: > > many thanks for your information. > I have tried a lot of scaling, from factor y1.0, x0.8 to y1.0, x0.5 , none > of them work. > I will try do some fine tune train with this font. > thanks again for let me known this font n

[tesseract-ocr] Re: why such simple word can't be recognized?

2018-08-16 Thread xllacyx
many thanks for your information. I have tried a lot of scaling, from factor y1.0, x0.8 to y1.0, x0.5 , none of them work. I will try do some fine tune train with this font. thanks again for let me known this font name :) 在 2018年8月15日星期三 UTC+8下午9:49:01,James Q写道: > > It looks like you may need t

[tesseract-ocr] Re: why such simple word can't be recognized?

2018-08-16 Thread xllacyx
yes, more funny thing is that if I feed all words to tesseract, it will successful read "ears" out, you can test with attachment . 在 2018年8月14日星期二 UTC+8下午8:41:06,zwwts...@gmail.com写道: > > It's interesting. I'v tried many way to process the img, binary inverse, > cut, resize. > I'v tried with oe

[tesseract-ocr] Re: Text2image Issues for Tamil Unicode Font Sundaram - 0807

2018-08-16 Thread Mugunthan
Hi Shree, Thanks for your reply. I've just tried this procedure using the same font in the Linux environment and it worked fine. I believe there should be some issue with Windows version text2image. I'm using ub mannheim version as suggested in github. -- You received this message because

[tesseract-ocr] Re: Text2image Issues for Tamil Unicode Font Sundaram - 0807

2018-08-16 Thread shree
>Ps: I've used other tamil unicode fonts such as Latha, Akshar and TheeneeUni, they all worked perfectly. As you recognize in statement above, problem is with the font not tesseract. On Tuesday, August 14, 2018 at 12:03:42 AM UTC+5:30, Mugunthan wrote: > > I've been making tif/box files for Tami

[tesseract-ocr] Recognize a bitmap is error

2018-08-16 Thread Nguyen Nhat Khanh
Hi All, I used the Tesseract 4.0 to extract text from a bitmap but i get a error when i call Recognize method. => Error : Assertion failed: index >= 0 && index < size_used_, file E:\Libs\Text\tesseract-master\src\ccutil\genericvector.h, line 720 Can you help me? My Code : tesseract::Te

Re: [tesseract-ocr] Make lstm for some files

2018-08-16 Thread Shree Devi Kumar
You need to make lstmf file for each of these. eg. tesseract fas.B_Mitra.exp0.tif fas.B_Mitra.exp0 --psm 6 lstm.train will create fas.B_Mitra.exp0.lstmf On Thu, Aug 16, 2018 at 5:40 PM, Zohreh Khosrobeygi wrote: > I have some tif and box files for each font for example: > fas.B_Mitra.exp

[tesseract-ocr] Make lstm for some files

2018-08-16 Thread Zohreh Khosrobeygi
I have some tif and box files for each font for example: fas.B_Mitra.exp0.box fas.B_Mitra.exp0.tif fas.B_Mitra.exp1.box fas.B_Mitra.exp1.tif fas.B_Mitra.exp2.box fas.B_Mitra.exp2.tif . . . How can I make lstm for each of them? Thx. -- You received this message because you are subscribed to the Go