Per instructions on web I ran:
gswin32 -r72x72 -sDEVICE=tiff12nc -sOutputFile=ocr_%02d.tif -dBATCH -
dNOPAUSE c:/php-mode.pdf
in order to ocr this pdf as a test.
First I used 300x300 resolution but that produced 12 meg tiffs, one
for each of 33 pages in pdf.
Both runs produced the following er
After installing tesseract-ocr 3.0 successfully and running it
against 3 or 4 pdfs, I now get the following error
C:\tesseract\Tesseract-OCR>tesseract ocr_107.tif beglat
Error openning data file C:\Program Files\Tesseract-OCR\tessdata/
eng.traineddata
A dir on ...\tessdata shows:
10/03/2010 0
ata file C:\Program Files\Tesseract-OCR\tessdata/
eng.traineddata
Well behaved w32 apps like emacs and gnuw32 utilities don't tell
Windows about themselves, why does tesseract have to?
On Apr 12, 6:59 pm, caudex wrote:
> After installing tesseract-ocr 3.0 successfully and running it
>
test
--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit
https://groups.google
only a test
--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to tesseract-ocr+unsubscr...@googlegroups.com.
To view this discussion on the web visit
https://groups
Can't Ghostscript do all that?
E.g. (for pdf):
gs [or] gswin32c -r300 -sDEVICE=tiffgray -sOutputFile=ocr_%02d.tif
-dBATCH -dNOPAUSE sample.pdf
This produces a series of 8 bit per pixel tif files at 300 dpi.
Graham Chiu wrote:
> You'll probably have to also install imagemagick to convert ima
At code.google.com it says that the 1.9 meg zip file is deprecated. Is
there a more up to date one anywhere? If I just get the .exe file will
it work with the old language files. I am playing with version 2.04 now.
Thanks,
Ed
--
You received this message because you are subscribed to the Googl
zdenko podobny wrote:
> On Wed, Apr 13, 2011 at 2:31 AM, caudex wrote:
>
>> After using regedit and pointing tessdata_prefix to the right place
>> and running again I got an error that referred to unicharset. The
>> entire contents of my tessdata subdirectory is:
>>
8 matches
Mail list logo