Custom rule aware of occurrences

Bert Van de Poel Sun, 15 Sep 2019 19:54:03 -0700

Dear fellow Spamassassin users,

I'm contacting you as a member of ULYSSIS. ULYSSIS is a studentnon-profit organisation at the University of Leuven trying to makecomputers and technology more approachable and available to students. Aspart of this objective, we run a hosting service within our university'snetwork for student organisations, student unions and individuals at ouruniversity.

We've battled with spam from time to time, since we seem to attract alot of exotic languages which are rather well able to circumventcommonly used methods. This has had us resort to some custom rulesets tobattle against mostly targetted French and SEO spam often coming fromvery respectable servers and very normal addresses.

Now because SEO spam specifically has been adapting quite well to anyrule we think of (finding alternative ways of saying the same thing timeand time again), I was hoping to write a rule that basically boiled downto "give some spam score to emails that contain the word SEO 3 or moretimes" to push those already being detected by other rules over theedge. To be clear, this will be a low score rule, I'm aware that ham canperfectly well contain that word 3 times, just like this email forexample. Now while investigating I started wondering how to tackle thatsome spam will just have a plain text body, while others will alsofeature HTML, which means that suddenly the amount may double/half.Beyond that it seems quite hacky to use a regex that boils down tosomething like /\bSEO\b.*\bSEO\b.*\bSEO\b/i instead of something that isproperly aware of the count of certain words.

Since I sort of expected Spamassassin to have a solution for both thetext/text+html and the counting problems, I asked around on IRC but waspointed here. So uhm, any suggestions or pointers are more than welcome.Not too sure if any more information is required, but feel free to askquestions or corect my presumptions if necessary.


Kind regards,
Bert Van de Poel
ULYSSIS
University of Leuven

Custom rule aware of occurrences

Reply via email to