Re: Shortcircuit Question

Clayton Keller Thu, 08 May 2008 09:33:53 -0700

Justin Mason wrote:

Clayton Keller writes:
Justin Mason wrote:
Matt Kettler writes:
Clayton Keller wrote:
I have been reading throught the Shortcircuit manpage as well as somearticles within the Wiki, and the manner in which I see it performingwithin our install does not seem to coincide with how I am reading andpresumably understanding it to work.
First off, we are using SpamAssassin 3.2.4 provided by the rpmforgebatch of rpm's.
I have about a dozen priorities specified mainly handling URIBL,SURBL, as well as DCC, Razor, Pyzor and Bayes.
What I am seeing is that although the first shortcircuit rule hits andscores appropriately. Subsequent short circuit rules will continue tofire. The scores themselves are then totaled along with the originalscores for the rules.
My understanding of how the shortcircuit should work is that once ashortcircuit is triggered any subsequent rules should be bypassed andthe message wither classified as spam/ham or if set to on, it woulduse the current score specified for the rule.
As a for instance:

I have the following:

priority URIBL_BLACK            -500
priority URIBL_JP_SURBL         -498
priority URIBL_SC_SURBL         -488
priority URIBL_OB_SURBL         -487

priority SC_URIBL_SURBL         -480
priority SC_URIBL_SBL           -479

priority RAZOR2_CHECK           -450
priority DCC_CHECK              -449
priority PYZOR_CHECK            -448

priority SC_URIBL_HASH          -440

meta SC_URIBL_SURBL    (URIBL_BLACK && (URIBL_SC_SURBL
                           || URIBL_JP_SURBL || URIBL_OB_SURBL))

meta SC_URIBL_SBL    ((URIBL_BLACK || URIBL_SC_SURBL ||
URIBL_JP_SURBL || URIBL_OB_SURBL) &&URIBL_SBL)
meta SC_URIBL_HASH    ((URIBL_BLACK || URIBL_SC_SURBL ||
URIBL_JP_SURBL || URIBL_OB_SURBL) &&(RAZOR2_CHECK || DCC_CHECK || PYZOR_CHECK))
meta SC_URIBL_SBL       ((URIBL_BLACK || URIBL_SC_SURBL ||
URIBL_JP_SURBL || URIBL_OB_SURBL) &&URIBL_SBL)
shortcircuit SC_URIBL_SURBL             spam
shortcircuit SC_URIBL_SBL               spam
shortcircuit SC_URIBL_HASH             spam

score SC_URIBL_SURBL            100.00
score SC_URIBL_HASH             100.00
score SC_URIBL_SBL              100.00
I do not have a recent debug to show but I can say that from the debugI do see the SC_URIBL_SURBL trigger, after the earlier priority rulesare ran. However, the remaining priorities are then ran, and ifmeeting critera, the RAZOR, DCC, and PYZOR rules run and then theSC_URIBL_HASH rule would trigger. Thus giving a total score of 200 +the scores for the URIBL/SURBL scores that hit and if included theRazor, DCC, and Pyzor scores as well.
I was thinking after the SC_URIBL_SURBL was triggered remaining ruleswould not run, and the spam classification would take precendence.
Am I overlooking the obvious, have I misunderstood how the SC shouldwork, is it something with the rpm that was released by rpmforge? Anythoughts or insight would be appreciated.
SA is, rather fortunately, circumventing what you're trying to dobecause of how DNS is handled internally.
DO NOT try to split up the priority of DNS based tests. Priority andshortcircuiting is intended to be used on *fast* rules, not slow ones.
If you were successful, you would make the performance of SpamAssassinabsurdly slow by serializing DNS queries. *OUCH*. SA normally runs thesein parallel, and running them in serial would very seriously impactperformance.
Currently, all DNS based tests "run" at their priority, but that onlylaunches the DNS queries. All the results are gathered together atHARVEST_DNSBL_PRIORITY, which is currently set to 500. None of the ruleswill actually trigger until this point.
actually Matt, you're wrong ;)  if some of the network rules are
at a higher priority than others, and are used in shortcircuit rules,
SpamAssassin 3.2.x will indeed sleep until the results of those rules
arrive.

The idea is that, if you have the memory to support that degree of
concurrency, you can make a local policy decision to do that, instead
of doing the lookups at the MTA level which does effectively the
same thing.

This wait is logged, so you can spot it with --debug on.

--j.
With that said Justin, is the behavior I am seeing correct? Even thoughthe first prioritized shortcircuit rule hits, and I see that in thedebug log, shouldn't it be bypassing the remaining rules rather thancontinuing to process until all the shortcircuit priorities have ran?
From reading the initial bug when this was originally featured, alongwith the man page, as well as a wiki post with an example by you, thatis how I understood it to function.
actually, no, it sounds like a bug.    could you open a bugzilla with
a demonstration config/test message?

--j.

Just to add, with my previous debug, I can confirm the waiting of thetests to finish as you mentioned from the debug. I'll make sure this isincluded when the bug is filed.

Also, what happened to independently enabling/disabling autolearning viatflags? From a previous post a while back, I was informed that due toperformance gains shortcircuit rules were not learned as either ham orspam due to their results due in part to creating additional performancegains with the extra bayes processing that is required. Looking throughthe original patch thread it appears this was a part of the discussionto some degree, but I was just wandering what the final say on that wasas well.


Clay

Re: Shortcircuit Question

Reply via email to