Re: New spam rule for specific content

Amir 'CG' Caspi Sat, 10 Aug 2013 12:47:22 -0700

At 10:41 AM -0700 08/09/2013, John Hardin wrote:

Can you provide a spample or two?

Looks like a similar spam method has come out in recent weeks (sinceJul 30, it seems) that uses slightly different footers... example ishere:


http://pastebin.com/QCmSPzwG

Although running SA on this spam _NOW_ yields a high score beyond thespam threshold, this is almost entirely because additional networktests are now hitting (extra RBLs + Razor). This was not the casewhen the spam was first processed... looks like I was one of theearlier recipients.

For this type, looks like a good match would be on the combo of"/land/" + "/unsub/" + "/report/" ... I have modified my rule fromyesterday as follows:


# Spammy URI patterns
uri __OUTL_URI  /\/outl\b/
uri __OUTI_URI  /\/outi\b/
uri __LAND_URI  /\/land\//
uri __UNSUB_URI /\/unsub\//
uri __REPORT_URI        /\/report\//

meta SPAMMY_URI_PATTERNS ((__OUTL_URI && __OUTI_URI) ||(__LAND_URI && __UNSUB_URI && __REPORT_URI))

describe SPAMMY_URI_PATTERNS    link combos match highly spammy template
score SPAMMY_URI_PATTERNS       3

This modification hits both types of templates. I will very likelybe adding further "spammy patterns" to this rule over time. I'llkeep the list posted if I find some other good ones.

It looks like both this and the previous type of spam are bypassingBayes by embedding images and using no rendered text. Well, not NOtext, but very little, mostly a "successful delivery" message and theunsub/report links. That is, Bayes sees absolutely no "spammy" text,just the image which it cannot decode as spammy.

Are there any rules which can hit on "only embedded images with verylittle text" ?? Not entirely sure how to capture this since it'sdifficult to determine what is "not much" text and there is certainlythe potential for FPs that way (for example, anyone in the designfield sending images to clients without much text, etc.)...

But, these types of spams are bypassing SA consistently, to the tuneof tens per day per user. I would really love a way to stop thembesides hardcoding a rule based on their link syntax, which can beeasily changed during the next iteration of their spam template.

(The HTML comment gibberish rule would be a big step here, sincethat's one of the few things that would distinguish this from ham...unlikely that a real person would embed tens of KB of commentgibberish.)


Thanks.

                                                --- Amir

Re: New spam rule for specific content

Reply via email to