Well, it appears to me that you are trying to extract words from a
document/image.
If tesseract functions correctly on an image, it can definitely give you
words and you could possibly use that to break up the image.
That said, I am not sure that Tesseract works great on handwritten words.
M
On
Hmm. What page_seg_mode are you using?
On Wed, Apr 25, 2012 at 3:17 AM, Pleiades wrote:
> Just to be sure I was clear about what I have and what I want.
>
> In version 3.0 I had as output file what I am searching now in version
> 3.01 (each \t corresponds to one box of my Excel array)
>
> Titre
I find a ton of data on how to train the tesseract 3 software with new
character sets by using the provided tools. However, I am wondering if
I could accomplish this by using the C# API of tessnet2. I want
tesseract to learn a custom font that is used in my company's software
applications. I have t
Pix and other TessBaseAPI data structures, such as STRING, are not
available to Java client code as Java objects. I am only able to use the
types that the C wrapper exposes. Moreover, keeping external dependencies
to a minimum is also a desirable design objective.
I did put in a fix that has ma
I did test your image unmodified with the updated version and was able to
get some result out of it:
Settings
? Wireless & networks
I‘ Call settings
i’) Sound
?? Display
E Location & security
Applications
Rn AFFFHIVYIT 2. €\H'\f'
On Wednesday, April 25, 2012 3:39:07 AM UTC-5, harry asir wro
On Wed, Apr 25, 2012 at 11:10 AM, Lucas Swartsenburg wrote:
> Excuse me, the correct term would be: segmentation. So this sentence would
> be segmented in:
>
> "So", "this", "sentence", "would", "be", "in". (all of these are images
> of the handwritten words).
>
>
> I am not sure if I got your po
Dn(a 25.04.2012 14:06, Nonmaskable Interrupt wrote / napísal(a):
> I just built 3.02 from svn using VS2008 and it seems to work fine, except
> that newline characters
> are Linux standard ('/n') instead of windows ('\r\n') standard. This is a
> change from previous behavior;
> is it intentional?
i am using english language for tesseract.
On Wed, Apr 25, 2012 at 3:58 PM, Ankur Rana wrote:
> which language script you want to use tesseract?
>
> On Wed, Apr 25, 2012 at 3:04 PM, Ashish Makwana
> wrote:
> > Respected Sir,
> >
> > I like to know you that i am using tesseract-ocr 2.04 .It is w
I just built 3.02 from svn using VS2008 and it seems to work fine, except
that newline characters
are Linux standard ('/n') instead of windows ('\r\n') standard. This is a
change from previous behavior;
is it intentional?
--
You received this message because you are subscribed to the Google
Grou
which language script you want to use tesseract?
On Wed, Apr 25, 2012 at 3:04 PM, Ashish Makwana wrote:
> Respected Sir,
>
> I like to know you that i am using tesseract-ocr 2.04 .It is working
> for regular char but it is not work for cursive char.
> so, please give me solution for that as early
Just to be sure I was clear about what I have and what I want.
In version 3.0 I had as output file what I am searching now in version
3.01 (each \t corresponds to one box of my Excel array)
Titre CLA ORI MET NUM D NUM F X (pm) Y (pm) SE CLR
FOR RES (Excel picture: 12 columns f
Excuse me, the correct term would be: segmentation. So this sentence would
be segmented in:
"So", "this", "sentence", "would", "be", "in". (all of these are images of
the handwritten words).
On Wednesday, April 25, 2012 5:41:57 AM UTC+2, Mayur Mudigonda wrote:
>
> what do you mean by separat
Excuse me, I used the wrong term. The correct one is: word segmentation. So
if I want to segment the sentence:
I am Luuk.
I get:
"I", "am", "Luuk" (all of these will be small images of the handwritten
words)
On Wednesday, April 25, 2012 5:41:57 AM UTC+2, Mayur Mudigonda wrote:
>
> what do y
Respected Sir,
I like to know you that i am using tesseract-ocr 2.04 .It is working
for regular char but it is not work for cursive char.
so, please give me solution for that as early as possible.
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
Hi,
Issue still exists in the latest beta version of Tess4J also.
If I convert the image to 300 DPI, whether i can get the coordinates
of the text corresponding to original image using hocr on the
converted 300 DPI image. If yes, can u guide me how to convert the
image to 300 DPI using Java.
Not
15 matches
Mail list logo