Just another command sequence which worked well on a file containing an image too:

gs -sOutputFile=hugo -sDEVICE=pnmraw -dNOPAUSE -dBATCH -r600x600 hugo.pdf
cat hugo | pamthreshold -simple -threshold 0.5 | pamtopnm | ocrad --format=utf8

This could be a base for another prep and scanset for FuzzyOcr.

Just some ideas....

Claude

Reply via email to