Re: [SAtalk] Phony Habeas mark on spam...I knew it was just a matter of time

David A. Carter Mon, 12 Jan 2004 07:40:02 -0800

Quoting Theo Van Dinter <[EMAIL PROTECTED]>:

> If you learn a message that has already been learned as the opposite
> type,
> SA will auto-forget it before learning the way you specify. :)


I understand this, always have. This is what I usually do. I only did this
is because I wanted to learn over a different message than the one that was
auto-learned (different because I removed the habeas mark). 

> 
> As for removing the Habeas marks ...   I wouldn't do that, since they
> are,
> in your case, an indication of spam.

It's either a good indication of spam, or it is a good indication of ham.
Either way, i don't think autolearn should take it into consideration (more
on this later).

> 
> It's default if you enable network checks (it's an RBL). :)

Ah, ok. I usually set up any RBL checks at sendmail and not at SA. I
couldn't find any info about the RBL on habeas's site...is it in a different
RBL somewhere? How can I find out more about it? I already block using SBL
and XBL...will those catch it?

> Well, this is less a question of "should it be autolearned" and more
> of a "how good is the Habeas system"...  In the perfect world, it's
> not forgable/misused and you would always accept it as a sign of ham,
> and therefore autolearning is desired.
> 

Agreed. However, it seems we should at least assume spammers will try to
attack the Habeas model, and act accordingly.

> Since we don't live in the perfect world, the question is: can the
> Habeas folks act fast/complete enough so that forging/misusing the mark
> is completely minimized?  If they can, then there's not a huge issue --
> yeah, some spam will get through, but they'll quickly be squashed and
> there you go.  If they can't, then their whole business plan fails as
> people start ignoring the mark, and again no problem since the SA rules
> would go away.
>

I agree with all this, but what I'm worried about is my bayes database being
posioned in the meantime. This has already happened with the [EMAIL PROTECTED]@ or
whatever emails, they have been auto-learned as ham 15 times since we last
talked--i'll bet you anything the guy could send one now without the Habeas
mark and it will get thru on bayes scoring alone. I've finally got things
straightened out I think, and I've had to disable the habeas mark checks in
my local.cf by scoring them at 0.

Consider: With the current scoring, If an email has a habeas mark on it, it
doesn't really need to be added to the bayes database as the habeas mark
will always pull down the score low enough to mark it as ham (except for the
most extreme cases). So we don't really need to add those particular
messages to the ham database anyways, as good as ham examples as they may
be. On the other hand, if spammers try to abuse the habeas mark by sending
out buckets of spammy emails with a forged mark, even after habeas fixes the
problems, we're going to be seeing FNs forever more because now all that
spam looks like ham to bayes. I'm just saying, the auto-learn for
habeas-marked emails should be thought about...I don't see really any
downside to removing it (or making it user configurable), and I see a lot of
upside.

Regards;

DaC



-------------------------------------------------------
This SF.net email is sponsored by: Perforce Software.
Perforce is the Fast Software Configuration Management System offering
advanced branching capabilities and atomic changes on 50+ platforms.
Free Eval! http://www.perforce.com/perforce/loadprog.html
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Re: [SAtalk] Phony Habeas mark on spam...I knew it was just a matter of time

Reply via email to