Re: Tessarct for separating handwritten words

2012-04-25 Thread Mayur Mudigonda
Well, it appears to me that you are trying to extract words from a document/image. If tesseract functions correctly on an image, it can definitely give you words and you could possibly use that to break up the image. That said, I am not sure that Tesseract works great on handwritten words. M On

Re: Shift Enter between each element of a Excel grid

2012-04-25 Thread Mayur Mudigonda
Hmm. What page_seg_mode are you using? On Wed, Apr 25, 2012 at 3:17 AM, Pleiades wrote: > Just to be sure I was clear about what I have and what I want. > > In version 3.0 I had as output file what I am searching now in version > 3.01 (each \t corresponds to one box of my Excel array) > > Titre

Training tessnet2 with custom font

2012-04-25 Thread Sebastian Siatkowski
I find a ton of data on how to train the tesseract 3 software with new character sets by using the provided tools. However, I am wondering if I could accomplish this by using the C# API of tessnet2. I want tesseract to learn a custom font that is used in my company's software applications. I have t

Re: Tess4J 1.0 Beta Release

2012-04-25 Thread Quan Nguyen
Pix and other TessBaseAPI data structures, such as STRING, are not available to Java client code as Java objects. I am only able to use the types that the C wrapper exposes. Moreover, keeping external dependencies to a minimum is also a desirable design objective. I did put in a fix that has ma

Re: Tess4J 1.0 Beta Release

2012-04-25 Thread Quan Nguyen
I did test your image unmodified with the updated version and was able to get some result out of it: Settings ? Wireless & networks I‘ Call settings i’) Sound ?? Display E Location & security Applications Rn AFFFHIVYIT 2. €\H'\f' On Wednesday, April 25, 2012 3:39:07 AM UTC-5, harry asir wro

Re: Tessarct for separating handwritten words

2012-04-25 Thread zdenko podobny
On Wed, Apr 25, 2012 at 11:10 AM, Lucas Swartsenburg wrote: > Excuse me, the correct term would be: segmentation. So this sentence would > be segmented in: > > "So", "this", "sentence", "would", "be", "in". (all of these are images > of the handwritten words). > > > I am not sure if I got your po

Re: Windows newline?

2012-04-25 Thread Zdenko Podobný
Dn(a 25.04.2012 14:06, Nonmaskable Interrupt wrote / napísal(a): > I just built 3.02 from svn using VS2008 and it seems to work fine, except > that newline characters > are Linux standard ('/n') instead of windows ('\r\n') standard. This is a > change from previous behavior; > is it intentional?

Re: how to detect cusrsive char using ubuntu

2012-04-25 Thread Ashish Makwana
i am using english language for tesseract. On Wed, Apr 25, 2012 at 3:58 PM, Ankur Rana wrote: > which language script you want to use tesseract? > > On Wed, Apr 25, 2012 at 3:04 PM, Ashish Makwana > wrote: > > Respected Sir, > > > > I like to know you that i am using tesseract-ocr 2.04 .It is w

Windows newline?

2012-04-25 Thread Nonmaskable Interrupt
I just built 3.02 from svn using VS2008 and it seems to work fine, except that newline characters are Linux standard ('/n') instead of windows ('\r\n') standard. This is a change from previous behavior; is it intentional? -- You received this message because you are subscribed to the Google Grou

Re: how to detect cusrsive char using ubuntu

2012-04-25 Thread Ankur Rana
which language script you want to use tesseract? On Wed, Apr 25, 2012 at 3:04 PM, Ashish Makwana wrote: > Respected Sir, > > I like to know you that i am using tesseract-ocr 2.04 .It is working > for regular char but it is not work for cursive char. > so, please give me solution for that as early

Re: Shift Enter between each element of a Excel grid

2012-04-25 Thread Pleiades
Just to be sure I was clear about what I have and what I want. In version 3.0 I had as output file what I am searching now in version 3.01 (each \t corresponds to one box of my Excel array) Titre CLA ORI MET NUM D NUM F X (pm) Y (pm) SE CLR FOR RES (Excel picture: 12 columns f

Re: Tessarct for separating handwritten words

2012-04-25 Thread Lucas Swartsenburg
Excuse me, the correct term would be: segmentation. So this sentence would be segmented in: "So", "this", "sentence", "would", "be", "in". (all of these are images of the handwritten words). On Wednesday, April 25, 2012 5:41:57 AM UTC+2, Mayur Mudigonda wrote: > > what do you mean by separat

Re: Tessarct for separating handwritten words

2012-04-25 Thread Lucas Swartsenburg
Excuse me, I used the wrong term. The correct one is: word segmentation. So if I want to segment the sentence: I am Luuk. I get: "I", "am", "Luuk" (all of these will be small images of the handwritten words) On Wednesday, April 25, 2012 5:41:57 AM UTC+2, Mayur Mudigonda wrote: > > what do y

how to detect cusrsive char using ubuntu

2012-04-25 Thread Ashish Makwana
Respected Sir, I like to know you that i am using tesseract-ocr 2.04 .It is working for regular char but it is not work for cursive char. so, please give me solution for that as early as possible. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group.

Re: Tess4J 1.0 Beta Release

2012-04-25 Thread harry asir
Hi, Issue still exists in the latest beta version of Tess4J also. If I convert the image to 300 DPI, whether i can get the coordinates of the text corresponding to original image using hocr on the converted 300 DPI image. If yes, can u guide me how to convert the image to 300 DPI using Java. Not