-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 decoder wrote: > Hello there, > > I have improved the original OcrPlugin (found at > http://wiki.apache.org/spamassassin/OcrPlugin), so it contains > fuzzy matching. Like that, mistakes made by the OCR recognition or > intentional obfuscations in the text don't make the recognition > impossible. This is being done with a relative distance calculation > between the pattern (word from a given word list) and a line in > the recognized input. Also, the plugin uses dynamic scoring (more > matched words means more score, this can be adjusted in the > source). > > You can find a full description and an example in the wiki under: > > http://wiki.apache.org/spamassassin/FuzzyOcrPlugin > > > Ideas for improvements or critics are always welcome :) > > > Best regards, > > > Chris
Hello there, I've just released version 2.1c, which fixes problems when using Spamassassin + Mailscanner (score is always 1.0). Thanks for this bug report and patch to Howard Kash. Other (minor) changes: - -Fixed a typo (treshold -> threshold), if you are using this variable in your config, you need to fix this. - -Removed the '-' from jpegtopnm arguments to provide backwards compatiblity to older netpbm (as someone else mentioned here before) The updated version can be found at the usual download URL (see the spamassassin wiki under FuzzyOcr) Best regards Christian -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.5 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iD8DBQFE33TcJQIKXnJyDxURAukgAKCYIPpk1R0oHQH7qdCVtrd7DdHGowCfVsZh 3KUFvNC5v52BytjKnA2OooY= =0r9I -----END PGP SIGNATURE-----