Re: Shortcircuit Question

Clayton Keller Thu, 08 May 2008 19:43:43 -0700

Clayton Keller wrote:

Justin Mason wrote:
Clayton Keller writes:
Justin Mason wrote:
Matt Kettler writes:
Clayton Keller wrote:
I have been reading throught the Shortcircuit manpage as well assome articles within the Wiki, and the manner in which I see itperforming within our install does not seem to coincide with how Iam reading and presumably understanding it to work.
First off, we are using SpamAssassin 3.2.4 provided by therpmforge batch of rpm's.
I have about a dozen priorities specified mainly handling URIBL,SURBL, as well as DCC, Razor, Pyzor and Bayes.
What I am seeing is that although the first shortcircuit rule hitsand scores appropriately. Subsequent short circuit rules willcontinue to fire. The scores themselves are then totaled alongwith the original scores for the rules.
My understanding of how the shortcircuit should work is that oncea shortcircuit is triggered any subsequent rules should bebypassed and the message wither classified as spam/ham or if setto on, it would use the current score specified for the rule.
As a for instance:

I have the following:

priority URIBL_BLACK            -500
priority URIBL_JP_SURBL         -498
priority URIBL_SC_SURBL         -488
priority URIBL_OB_SURBL         -487

priority SC_URIBL_SURBL         -480
priority SC_URIBL_SBL           -479

priority RAZOR2_CHECK           -450
priority DCC_CHECK              -449
priority PYZOR_CHECK            -448

priority SC_URIBL_HASH          -440

meta SC_URIBL_SURBL    (URIBL_BLACK && (URIBL_SC_SURBL
                           || URIBL_JP_SURBL || URIBL_OB_SURBL))

meta SC_URIBL_SBL    ((URIBL_BLACK || URIBL_SC_SURBL ||
URIBL_JP_SURBL || URIBL_OB_SURBL) &&URIBL_SBL)
meta SC_URIBL_HASH    ((URIBL_BLACK || URIBL_SC_SURBL ||
URIBL_JP_SURBL || URIBL_OB_SURBL) &&(RAZOR2_CHECK || DCC_CHECK || PYZOR_CHECK))
meta SC_URIBL_SBL       ((URIBL_BLACK || URIBL_SC_SURBL ||
URIBL_JP_SURBL ||URIBL_OB_SURBL) && URIBL_SBL)
shortcircuit SC_URIBL_SURBL             spam
shortcircuit SC_URIBL_SBL               spam
shortcircuit SC_URIBL_HASH             spam

score SC_URIBL_SURBL            100.00
score SC_URIBL_HASH             100.00
score SC_URIBL_SBL              100.00
I do not have a recent debug to show but I can say that from thedebug I do see the SC_URIBL_SURBL trigger, after the earlierpriority rules are ran. However, the remaining priorities are thenran, and if meeting critera, the RAZOR, DCC, and PYZOR rules runand then the SC_URIBL_HASH rule would trigger. Thus giving a totalscore of 200 + the scores for the URIBL/SURBL scores that hit andif included the Razor, DCC, and Pyzor scores as well.
I was thinking after the SC_URIBL_SURBL was triggered remainingrules would not run, and the spam classification would takeprecendence.
Am I overlooking the obvious, have I misunderstood how the SCshould work, is it something with the rpm that was released byrpmforge? Any thoughts or insight would be appreciated.
SA is, rather fortunately, circumventing what you're trying to dobecause of how DNS is handled internally.
DO NOT try to split up the priority of DNS based tests. Priorityand shortcircuiting is intended to be used on *fast* rules, notslow ones.
If you were successful, you would make the performance ofSpamAssassin absurdly slow by serializing DNS queries. *OUCH*. SAnormally runs these in parallel, and running them in serial wouldvery seriously impact performance.
Currently, all DNS based tests "run" at their priority, but thatonly launches the DNS queries. All the results are gatheredtogether at HARVEST_DNSBL_PRIORITY, which is currently set to 500.None of the rules will actually trigger until this point.
actually Matt, you're wrong ;)  if some of the network rules are
at a higher priority than others, and are used in shortcircuit rules,
SpamAssassin 3.2.x will indeed sleep until the results of those rules
arrive.

The idea is that, if you have the memory to support that degree of
concurrency, you can make a local policy decision to do that, instead
of doing the lookups at the MTA level which does effectively the
same thing.

This wait is logged, so you can spot it with --debug on.

--j.
With that said Justin, is the behavior I am seeing correct? Eventhough the first prioritized shortcircuit rule hits, and I see thatin the debug log, shouldn't it be bypassing the remaining rulesrather than continuing to process until all the shortcircuitpriorities have ran?
From reading the initial bug when this was originally featured,along with the man page, as well as a wiki post with an example byyou, that is how I understood it to function.
actually, no, it sounds like a bug.    could you open a bugzilla with
a demonstration config/test message?

--j.
Just to add, with my previous debug, I can confirm the waiting of thetests to finish as you mentioned from the debug. I'll make sure this isincluded when the bug is filed.
Also, what happened to independently enabling/disabling autolearning viatflags? From a previous post a while back, I was informed that due toperformance gains shortcircuit rules were not learned as either ham orspam due to their results due in part to creating additional performancegains with the extra bayes processing that is required. Looking throughthe original patch thread it appears this was a part of the discussionto some degree, but I was just wandering what the final say on that wasas well.
Clay


Bug 5906 submitted. Thanks for your help on this.

Clay

Re: Shortcircuit Question

Reply via email to