distill <[EMAIL PROTECTED]> writes:

> I've been receiving these "msnbc.com - BREAKING NEWS" spams recently. I've
> made sure that all of those spams (over 40 of them) are manually trained to
> be spam. SpamAssassin does filter out those messages about 75% of the time.
> However, even after this careful manual training some of those spams are
> still getting through (my score threshold is now 4.4). I get the feeling
> that the training doesn't have any effect. Is there something wrong or is
> SpamAssassin just incapable of learning this? The msnbc spams are almost
> identical to eachother with lots of words, so I would imagine this should be
> an easy task.

On your message I also got:

        *  3.0 URIBL_BLACK Contains an URL listed in the URIBL blacklist
        *      [URIs: planetahd.com]
        *  1.9 URIBL_AB_SURBL Contains an URL listed in the AB SURBL blocklist
        *      [URIs: planetahd.com]

but not the received ones of course.  You didn't get this, but I see
that on uribl planetahd.com was listed at 0826Z today.

> Also I'd like to ask about the RCVD_IN-tags: Is it possible/probable that if
> there are more than one of those tags present, for example
> "RCVD_IN_BL_SPAMCOP_NET,RCVD_IN_SORBS_WEB", that the message in fact could
> still be ham?

I'm not sure what you're asking.  If you mean

  "If I get a message where both RCVD_IN_BL_SPAMCOP_NET and
  RCVD_IN_SORBS_WEB fire, is there any chance the message is still ham?"

I'd say yes.  Those blacklists probably have overlapping listing
critieria, and certainly two lists listing something is at least a bit
stronger than one, but not absolute.

I have edited my scores file to increase MIME_HTML_ONLY and if I were
you would increase HTML_TAG_BALANCE_BODY as well (probably 1 point
each), unless you find lots of ham hits on these.

Reply via email to