On Tue, Jun 05, 2012 at 11:39:29AM -0400, Kevin A. McGrail wrote:
> A) These are just sub rules for use in a meta.  As a specialist in
> meta rules, just because you hit a sub rule doesn't matter.  What
> matters is if it triggers a scoring rule.  Does it?
> 
> B) I don't recognize those rules or know where they came from.
> Where did they come from?
> 

The scoring rule is 4.0 JM_SOUGHT_3, which is one of the "sought
channel" rules distributed (and regularly updated) by the
sought.rules.yerp.org channel in SpamAssassin [1].

That link is a little dated, but the channel is not. It comes stock now
with `yum install spamassassin` on RHEL 6, and can be added to a local
installation of SA by following the instructions in the link above. The
specific path for my vanilla install is:

/var/lib/spamassassin/3.003002/sought_rules_yerp_org/20_sought.cf

As far as I can tell (admittedly, I haven't studied source), it's simply
doing regex matching on a variety of spammy content. Nothing terribly
sophisticated -- the pattern matching is straight up "does this exact
string exist?" The problem is it's picked up artifacts of CKEditor, a
common CRM/CMS editor. I was able to demonstrate the problem using
CKEditor's demo page [2], and posted the SO question Brett cited earlier
[3].

One option for us would be to disable the WYSIWYG, but I can't imagine
we're the only ones affected. The CKEditor user page lists a variety of
large companies and bulk email providers, including MailChimp [4].

[1] http://taint.org/2007/08/15/004348a.html
[2] http://ckeditor.com/demo
[3] 
http://stackoverflow.com/questions/10890407/ckeditors-html-artifacts-trigger-spamassassin-can-you-turn-ckeditors-html-mod
[4] http://ckeditor.com/who-is-using-ckeditor

--
Christopher Tiwald

Reply via email to