blank tiff files generated by ghostscript

2009-08-18 Thread caudex
Per instructions on web I ran: gswin32 -r72x72 -sDEVICE=tiff12nc -sOutputFile=ocr_%02d.tif -dBATCH - dNOPAUSE c:/php-mode.pdf in order to ocr this pdf as a test. First I used 300x300 resolution but that produced 12 meg tiffs, one for each of 33 pages in pdf. Both runs produced the following er

Problem with eng.traineddata after 3 or 4 successful runs against different pdf's

2011-04-12 Thread caudex
After installing tesseract-ocr 3.0 successfully and running it against 3 or 4 pdfs, I now get the following error C:\tesseract\Tesseract-OCR>tesseract ocr_107.tif beglat Error openning data file C:\Program Files\Tesseract-OCR\tessdata/ eng.traineddata A dir on ...\tessdata shows: 10/03/2010 0

Re: Problem with eng.traineddata after 3 or 4 successful runs against different pdf's

2011-04-12 Thread caudex
ata file C:\Program Files\Tesseract-OCR\tessdata/ eng.traineddata Well behaved w32 apps like emacs and gnuw32 utilities don't tell Windows about themselves, why does tesseract have to? On Apr 12, 6:59 pm, caudex wrote: > After installing tesseract-ocr 3.0 successfully and running it >

[tesseract-ocr] test

2023-11-07 Thread caudex
test -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google

[tesseract-ocr] off topic.... is this group moderated, can it be indexed by eternal september?

2023-11-29 Thread caudex
only a test -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups

Re: Any Tesseract installers for hire?

2009-09-01 Thread E. Caudex
Can't Ghostscript do all that? E.g. (for pdf): gs [or] gswin32c -r300 -sDEVICE=tiffgray -sOutputFile=ocr_%02d.tif -dBATCH -dNOPAUSE sample.pdf This produces a series of 8 bit per pixel tif files at 300 dpi. Graham Chiu wrote: > You'll probably have to also install imagemagick to convert ima

Where can I find latest (3.0??) w32 binary of tesseract

2011-04-07 Thread E. Caudex
At code.google.com it says that the 1.9 meg zip file is deprecated. Is there a more up to date one anywhere? If I just get the .exe file will it work with the old language files. I am playing with version 2.04 now. Thanks, Ed -- You received this message because you are subscribed to the Googl

Re: Problem with eng.traineddata after 3 or 4 successful runs against different pdf's

2011-04-14 Thread E. Caudex
zdenko podobny wrote: > On Wed, Apr 13, 2011 at 2:31 AM, caudex wrote: > >> After using regedit and pointing tessdata_prefix to the right place >> and running again I got an error that referred to unicharset. The >> entire contents of my tessdata subdirectory is: >>