At Sun Dec 28 17:55:46 2003, Ricardo Kleemann wrote: > > I don't know how this message got copied multiple times to > the list, I certainly only sent it once... I'm sorry for the > inconvenience...
Sometime sourceforge's mail servers do misbehave like this. > > > Anyway, is there an address I can send a tarbal with a > > > bunch of these messages? I keep getting spam which SA > > (using 2.60) consistently scores > > > > You could put the tarball on the web somewhere and mail > > the URL to the list. > > www.americasnet.com/spam_samples/spam_samples.tgz > > If anyone would be kind enough to take a look and give me > some pointers on how I could improve my SA configuration to > catch these messages, I'd greatly appreciate it. I get a few > of these every day, they're consistently scored low, and > running them through sa-learn doesn't seem to do much. There's not a great deal you can do with these, I don't think. The majority of these messages are in multipart/alternative form with a huge text/plain part containing what look to be extracts of books, and a text/html part containing the spam. A couple of the messages repeat the book-extract at the bottom of the text/html section using HTML/CSS tricks to render it (virtually) invisible. There's an enormous amount of bayes-busting material in these, so running them through sa-learn is not going to help much. The only thing I can think of that might help in future versions of SA would be to detect (a) multipart/alternative messages where the text/plain content is substatively different from the text/html component (though this is probably impossible), (b) multipart/alternative messages where the text/plain part is bigger than the text/html part (based on the assumption that where the two parts are substantively the same, the html part will be larger due to its markup). Martin -- Martin Radford | "Only wimps use tape backup: _real_ [EMAIL PROTECTED] | men just upload their important stuff -o) Registered Linux user #9257 | on ftp and let the rest of the world /\\ - see http://counter.li.org | mirror it ;)" - Linus Torvalds _\_V ------------------------------------------------------- This SF.net email is sponsored by: IBM Linux Tutorials. Become an expert in LINUX or just sharpen your skills. Sign up for IBM's Free Linux Tutorials. Learn everything from the bash shell to sys admin. Click now! http://ads.osdn.com/?ad_id=1278&alloc_id=3371&op=click _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk