Thanks for the replies.  I now have a working pair of post-SA procmail
filters that eliminates duplicate message ids and approximately duplicate
message bodies:

    :0
    * ^X-Spam-Status: Yes
    {
        :0 Wh: msgid.lock
        | $FORMAIL -D 16384 $HOME/tmp/msgid.cache

        :0 W: cksum.lock
        | $PYCKSUM $HOME/tmp/cksum.cache

        :0:
        $SPAM
    }

The pycksum script is a little Python script I whipped up that works
more-or-less like formail -D, but suitably abstracts the message body then
uses its md5 checksum as the basis for the equality test.  If anyone's
interested in it (I'm sure the many Perl mongers among you probably already
have something), let me know.  I'd be happy to shoot you a copy.

-- 
Skip Montanaro ([EMAIL PROTECTED] - http://www.mojam.com/)
"Excellant Written and Communications Skills required" - seen on chi.jobs

_______________________________________________________________

Don't miss the 2002 Sprint PCS Application Developer's Conference
August 25-28 in Las Vegas -- http://devcon.sprintpcs.com/adp/index.cfm

_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to