from:"Larry Kraemer via cctalk"

Re: OCR old software listing

2019-01-02 Thread Larry Kraemer via cctalk

The only way I've been able to get any type of readable ASCII TEXT from the .tif's is to do the following for each tif: convert -density 1200 -resize 40% xaaa.tif -density 1 xaaa120040.tif Then, OCR it with Irfanview with the KADMOS Plugin Installed. For the first Page I get the following ASCII:

Re: OCR old software listing

2018-12-31 Thread Larry Kraemer via cctalk

I used the libtiff-tools (Debian 8.x - 32 Bit) to extract all 61 .TIF's from the Multipage .tif file. While the .tif's look descent, and RasterVect shows the .tif properties to be Group 4 Fax (1bpp) with 5100 x 6600 pixels - 300 DPI, I can't get tesseract 3.x, TextBridge Classic 2.0, or Irfanview