On Tue, 14 Aug 2018 11:38:27 -0400 micah anderson wrote: > Hi, > > I'm trying to understand the ruleQA results because I'm trying to > track down how common the rule FRNAME_IN_MSG_NO_SUBJ is spammy. > > I load the latest rules: > http://ruleqa.spamassassin.org/20180813-r1837926-n/FRNAME_IN_MSG_NO_SUBJ/detail?s_corpus=1&s_g_over_time=1#overtime > > and I see the S/O value is 1.0, which is a rule that hits only on spam > (a rule that only hits on ham is 0.0, a rule that doesn't anything is > 0.5)... but how can I tell how many messages are part of the corpus?
'mouseover' the percentages > Also, the percentages seem very low: 1.5192% Spam, and .0005% > Ham... 1.5% seems low to me to be adding 3.5 score to this rule, The only reason that that might be a problem is if all the hits occurred in a single short period, which would suggest it's a property of a single spam run. Other than that the scores come from an optimization process. You can see why it gets a large score just by looking at the score-map section.