hi,
tested with tesseract 3.04 on windows:
i recently also tried a printed page which gave better results nearly no
errors.
here's the command and the output of tesseract:
$ tesseract c:\LockBits_0_0.tif c:\LockBits_0_0 -l eng
Tesseract Open Source OCR Engine v3.04.00 with Leptonica
cygwin warning:
MS-DOS style path detected: c:\LockBits_0_0.txt
Preferred POSIX equivalent is: /LockBits_0_0.txt
CYGWIN environment variable option "nodosfilewarning" turns off this
warning.
Consult the user's guide for more details about POSIX paths:
http://cygwin.com/cygwin-ug-net/using.html#using-pathnames
Page 1
Warning in pixReadMemTiff: tiff page 1 not found
tesseract just is a little bit pissed off about some tiff issues it seems.
additionally it didn't like my typing of paths which doesn't do all that
much to it so we just ignore the cygwin warning.
i got that result:
Paramems
rear [In]
Type mnsl Red‘
Pointer to a rectangle that speotfies the pomon of the hnmap to he looked.
flags finl
Type um
Set offlzgs that specify whether the |od<2d pomon of the hnmap rs
avarlahle for reading or lor wrmllg
and whether the caller has already allouted a butter. lndlvlduzl llags
are defined in the lmagelucande
enumeration.
flmntn [ln]
Type Pixeanlmat
integer that specrfies the fumial of the pixel data In the lempurary
bulfer. The plxel tonnat ulthe
temporary hurler does not have in he the same as the plxel tonnat ulthrs
nitmap ubjen. The
pixelronnat data type and constants that represent various pixel lormats
are defined in
adiplusprxellormatsh. For more mtonnahon about pixel furmal cnns'ams see
Image Pixel Forrnat
Constants CD“ version 1.0 does run support processing of
lésbityperrchannel images so yuu should
run set this parameter equal to PixelFormaMBbppRGB, PlxelFormachppARGfi, ur
PlxeanmlathppPARGB.
lackedElfmapDaf-I fin. out]
Type BixmapData‘
Pointer to a BilmapData object If the lrnageLockModeUserlnputam flag
ulthe flagx parameter is cleared
then IodredBltmupDam serves only as an output parameter. in that use.
the 5am data rnemherotthe
aiunapneta otnect reoerves a porrner to a temporary putter, which is
filled with the values otthe
requested plxels. The other data members at the aiunapueta otnect
rederve attrihutes (wldlh. height
lormat and stride) of the plxel data m the temporary tamer. If the plxel
data is stored hunamup. the
snide data memtrer is negative. it the pixel data is stored mpsdown, the
snide data memtrer is positive.
If the lmageLodtModeUserlnputaul flag at the flags parameter is set then
lodredliixmapbam serves as an
Am 24.10.2014 um 20:25 schrieb BDristan:
I'm quite new to tesseract. I just tried to OCR an image as follows:
tesseract LockBits.tif LockBits -l eng
The output text was pretty messed up. I ran tesseract 3.02 on Win7.
I then run an on-line OCR and got a perfect result.
Could someone please give me some hints on how to improve OCR with
tesseract.
Attached is an image file that I used.
Thanks.
--
Simon Eigeldinger
Follow me on Twitter: http://www.twitter.com/domasofan/
E-Mail: simon.eigeldin...@vol.at
MSN: simon_eigeldin...@hotmail.com
ICQ: 121823966
Jabber: domaso...@andrelouis.com
---
Diese E-Mail ist frei von Viren und Malware, denn der avast! Antivirus Schutz
ist aktiv.
http://www.avast.com
--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit
https://groups.google.com/d/msgid/tesseract-ocr/544ABC4E.3090403%40vol.at.
For more options, visit https://groups.google.com/d/optout.