Re: Understanding ruleQA results

RW Tue, 14 Aug 2018 10:39:27 -0700

On Tue, 14 Aug 2018 11:38:27 -0400
micah anderson wrote:

> Hi,
> 
> I'm trying to understand the ruleQA results because I'm trying to
> track down how common the rule FRNAME_IN_MSG_NO_SUBJ is spammy.
> 
> I load the latest rules:
> http://ruleqa.spamassassin.org/20180813-r1837926-n/FRNAME_IN_MSG_NO_SUBJ/detail?s_corpus=1&s_g_over_time=1#overtime
> 
> and I see the S/O value is 1.0, which is a rule that hits only on spam
> (a rule that only hits on ham is 0.0, a rule that doesn't anything is
> 0.5)... but how can I tell how many messages are part of the corpus?



'mouseover' the percentages

 
> Also, the percentages seem very low: 1.5192% Spam, and .0005%
> Ham... 1.5% seems low to me to be adding 3.5 score to this rule,

 

The only reason that that might be a problem is if all the hits
occurred in a single short period, which would suggest it's a property
of a single spam run. Other than that the scores come from an
optimization process. You can see why it gets a large score just by
looking at the score-map section.

Re: Understanding ruleQA results

Reply via email to