Hi all,

After goggling and digging a little bit into mailing list archives, and of course with the help of this mailing list, i think we've managed to get what we want, and here goes just in case someones needs something similar.

Our system :
FreeBSD (4.8) + Qmail-Scanner (1.20rc4) + spamd+spamc (2.60)

Our problem :
Feed all the spam and ham we have into sa-learn, but removing the
headers added by qmail-scanner, since normal headers from Spamassassin
are recognized and ignored.

Headers like :

Received: from [EMAIL PROTECTED] by mail.host-services.com by uid 101 with qmail-scanner-1.20rc4
(spamassassin: 2.60. Clear:RC:0:SA:1(11.3/8.0):.
Processed in 0.328309 secs); 19 Nov 2003 23:34:45 -0000




Our solution :
Instead of using "bayes_ignore_header Received" and loose all the Received headers, we created this small shell script :


#!/bin/csh

mkdir clean-headers

foreach file (*)
echo "Removing headers from : "$file
if (-f $file) then
cat $file | egrep -vi "(spamassassin|processed|qmail-scanner)" > clean-headers/$file
endif
end


This script just creates a directory called clean-headers and copies all the email without the headers included by qmail-scanner.
Now we have a directory that can be feed into sa-learn, not worrying that sa-learn might learn something he shouldn't.


Comments are welcome as always.

Thank you.
Carlos Santos (Portugal)



-------------------------------------------------------
This SF.net email is sponsored by: SF.net Giveback Program.
Does SourceForge.net help you be more productive?  Does it
help you create better code?  SHARE THE LOVE, and help us help
YOU!  Click Here: http://sourceforge.net/donate/
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to