After goggling and digging a little bit into mailing list archives, and of course with the help of this mailing list, i think we've managed to get what we want, and here goes just in case someones needs something similar.
Our system : FreeBSD (4.8) + Qmail-Scanner (1.20rc4) + spamd+spamc (2.60)
Our problem : Feed all the spam and ham we have into sa-learn, but removing the headers added by qmail-scanner, since normal headers from Spamassassin are recognized and ignored.
Headers like :
Received: from [EMAIL PROTECTED] by mail.host-services.com by uid 101 with qmail-scanner-1.20rc4
(spamassassin: 2.60. Clear:RC:0:SA:1(11.3/8.0):.
Processed in 0.328309 secs); 19 Nov 2003 23:34:45 -0000
Our solution :
Instead of using "bayes_ignore_header Received" and loose all the Received headers, we created this small shell script :
#!/bin/csh
mkdir clean-headers
foreach file (*)
echo "Removing headers from : "$file
if (-f $file) then
cat $file | egrep -vi "(spamassassin|processed|qmail-scanner)" > clean-headers/$file
endif
end
This script just creates a directory called clean-headers and copies all the email without the headers included by qmail-scanner.
Now we have a directory that can be feed into sa-learn, not worrying that sa-learn might learn something he shouldn't.
Comments are welcome as always.
Thank you. Carlos Santos (Portugal)
------------------------------------------------------- This SF.net email is sponsored by: SF.net Giveback Program. Does SourceForge.net help you be more productive? Does it help you create better code? SHARE THE LOVE, and help us help YOU! Click Here: http://sourceforge.net/donate/ _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk