New spam rule for specific content

Amir 'CG' Caspi Fri, 09 Aug 2013 10:20:07 -0700

Hi all,

A number of my users have been receiving spam formatted in avery specific way which seems to very often miss Bayes... I don'tknow why, whether it's because of the HTML gibberish flooding Bayeswith useless tokens (to reduce the relative strength of the spammytokens), or if it's just the specific content isn't sufficientlyspammy (or has sufficient ham to balance) to pop.Either way, this spam appears to be generated from a specifictemplate, and I've created a rule to hit that template. Within thelast couple of weeks, I've had only true positives and negatives...no FPs, no FNs.


For your perusal, here is the rule:

# Spammy URI pattern
uri __OUTL_URI  /\/outl\b/
uri __OUTI_URI  /\/outi\b/
meta OUTL_OUTI_IS_SPAMMY        (__OUTL_URI && __OUTI_URI)
describe OUTL_OUTI_IS_SPAMMY    /outl + /outi link combo is highly spammy
score OUTL_OUTI_IS_SPAMMY       3

If you don't specifically trust URI rules to not have FPs, I have arawbody version of this which works identically... in all cases, bothrules pop together, so I think there's no specific need to use therawbody version, but I can provide it if needed.


I recommend this rule be added to the general distribution.

(Like many other users here, I've also increased the Bayes scores forBayes99, and created a Bayes999 with even higher scoring... it mightbe time to add that to the general distribution, too.)


Hope this helps...

                                                --- Amir

New spam rule for specific content

Reply via email to