Jorge Valdes wrote: > There are multiple images in these gifs, and because the first image > is 'junk', sending this image through gocr will yield no results. The > problem is that you have to scan all images to find the text. Try > this with each image: > > convert -append News.gif pnm:- | gocr - That works a treat... > > I have an updated version of the FuzzyOcr plugin that has this and > other improvements available here: > > http://www.joval.info/proj/FuzzyOcr.html > Version 2.3j works much better... I'd previously been using version 2.3b for which I had an ebuild for gentoo.
One thing I have noticed, however, is a number of errors/warnings which spamd sticks into /var/log/messages when it is started: -- Sep 26 17:20:48 server spamd[25563]: Subroutine new redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 122. Sep 26 17:20:48 server spamd[25563]: Subroutine parse_config redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 132. Sep 26 17:20:49 server spamd[25563]: Subroutine finish_parsing_end redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 184. Sep 26 17:20:49 server spamd[25563]: Subroutine dummy_check redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 288. Sep 26 17:20:49 server spamd[25563]: Subroutine load_global_words redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 292. Sep 26 17:20:49 server spamd[25563]: Subroutine load_personal_words redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 315. Sep 26 17:20:49 server spamd[25563]: Subroutine max redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 343. Sep 26 17:20:49 server spamd[25563]: Subroutine within_threshold redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 351. Sep 26 17:20:49 server spamd[25563]: Subroutine fmt_time redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 388. Sep 26 17:20:49 server spamd[25563]: Subroutine check_image_hash_db redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 414. Sep 26 17:20:49 server spamd[25563]: Subroutine add_image_hash_db redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 492. Sep 26 17:20:49 server spamd[25563]: Subroutine calc_image_hash redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 539. Sep 26 17:20:49 server spamd[25563]: Subroutine debuglog redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 580. Sep 26 17:20:49 server spamd[25563]: Subroutine wrong_ctype redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 590. Sep 26 17:20:49 server spamd[25563]: Subroutine corrupt_img redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 608. Sep 26 17:20:49 server spamd[25563]: Subroutine known_img_hash redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 626. Sep 26 17:20:49 server spamd[25563]: Subroutine removedir redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 637. Sep 26 17:20:49 server spamd[25563]: Subroutine fuzzyocr_check redefined at /etc/mail/spamassassin/FuzzyOcr.pm line 657. -- Have I somehow loaded this module twice? I didn't get these messages until I upgraded to version 2.3j from 2.3b....