At Sun Dec 28 17:55:46 2003, Ricardo Kleemann wrote:
> 
> I don't know how this message got copied multiple times to
> the list, I certainly only sent it once... I'm sorry for the
> inconvenience...

Sometime sourceforge's mail servers do misbehave like this.

> > > Anyway, is there an address I can send a tarbal with a
> > > bunch of these messages? I keep getting spam which SA
> > (using 2.60) consistently scores
> > 
> > You could put the tarball on the web somewhere and mail
> > the URL to the list. 
>
> www.americasnet.com/spam_samples/spam_samples.tgz
> 
> If anyone would be kind enough to take a look and give me
> some pointers on how I could improve my SA configuration to
> catch these messages, I'd greatly appreciate it. I get a few
> of these every day, they're consistently scored low, and
> running them through sa-learn doesn't seem to do much.

There's not a great deal you can do with these, I don't think.

The majority of these messages are in multipart/alternative form with
a huge text/plain part containing what look to be extracts of books,
and a text/html part containing the spam.

A couple of the messages repeat the book-extract at the bottom of the
text/html section using HTML/CSS tricks to render it (virtually)
invisible. 

There's an enormous amount of bayes-busting material in these, so
running them through sa-learn is not going to help much.

The only thing I can think of that might help in future versions of SA
would be to detect (a) multipart/alternative messages where the
text/plain content is substatively different from the text/html
component (though this is probably impossible), (b)
multipart/alternative messages where the text/plain part is bigger than
the text/html part (based on the assumption that where the two parts
are substantively the same, the html part will be larger due to its
markup).

Martin
-- 
Martin Radford              |   "Only wimps use tape backup: _real_ 
[EMAIL PROTECTED] | men just upload their important stuff  -o)
Registered Linux user #9257 |  on ftp and let the rest of the world  /\\
- see http://counter.li.org |       mirror it ;)"  - Linus Torvalds _\_V


-------------------------------------------------------
This SF.net email is sponsored by: IBM Linux Tutorials.
Become an expert in LINUX or just sharpen your skills.  Sign up for IBM's
Free Linux Tutorials.  Learn everything from the bash shell to sys admin.
Click now! http://ads.osdn.com/?ad_id=1278&alloc_id=3371&op=click
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to