On 4/1/2015 12:41 PM, Amir Caspi wrote:
Going back to this:

On Apr 1, 2015, at 7:47 AM, Bowie Bailey <bowie_bai...@buc.com> wrote:

That might be reasonable for most email addresses, but there are quite a few 
people who have a usable name or nickname as the user part of their email.  
(j...@example.com).  It would not make sense to score an email just for having 
their name in the subject.
Well, this wouldn't be the first or only rule that doesn't work for everyone... plus, I would 
certainly make it case sensitive, so that "John" wouldn't match "john@", for 
example.  This rule could be disabled by default and turned on by people who want it, or vice 
versa.  I'd also imagine it would generate a lower score from masscheck than the regular TO_IN_SUBJ 
would, and hence would be of less impact towards FPs (but that extra few-tenths of a point could 
make the difference to push a lot of these spams over the threshold, particularly if they hit 
BAYES_999 but not any other rules, as many snowshoe spams often do in the early stages).

And then there are addresses which use normal words in the address which would 
also not make sense to score.  For example: i...@example.com, 
ab...@example.com, supp...@example.com, etc.
Indeed, and those likely-FP words could be explicitly excluded via negative 
match, so that qw(info abuse support mail) etc. wouldn't score.  The same could 
be done for common names, I suppose, although I agree that gets a bit 
cumbersome.

Anyway, it was just a thought... I'd certainly support such a rule, even if it 
had to be manually enabled or rescored.

I don't think it would work as a standard rule. It would have too much variance in the FP rate depending on the email address and trying to maintain a list of problematic words/names would probably be too cumbersome in the general case.

It might work as an informational rule (score 0.001) that admins could use in meta rules or increase scoring on a per-user basis.

--
Bowie

Reply via email to