Re: timeouts on processing some messages, started October 24

Bill Cole Wed, 03 Nov 2021 06:32:39 -0700

On 2021-11-02 at 19:15:33 UTC-0400 (Tue, 02 Nov 2021 19:15:33 -0400)
Greg Troxel <g...@lexort.com>
is rumored to have said:

I have a systeem with postfix and spamassassin 3.4.6 via spamd.  It's
been generally running well.  I noticed mail from one of my other
systems timing out and 471, and that caused me to look at the logs.  I
have KAM rules, some RBL adjustments, a bunch of local rules for my
spam, but really nothing I consider unusual.

[...]

and thus I have two problems:
need to have postfix delay be more than spamassassin delay plusrounding

It would generally be a bad idea to increase the Postfix timeout, asthat passes the problem back upstream as senders will generally time outat 300s as well.

So, add '--timeout-child=295' to your spamd arguments if you want tomake spamd timeout faster than Postfix reliably.

  need to figure out why there is a timeout


That's the important part.

The first is surely manual reading, but I wonder why it isn't default.

We don't try very hard to guess what users will want in the integrationdetails between SA and the tools like MTAs that use it. 300s is the SMTPdefault timeout at end-of-data, which presumably is why it is spamd'sdefault. I think it makes sense to reduce that for most circumstances,but I'm a bit hesitant to do so in the distribution because there couldbe people relying on the specific idiosyncratic behavior of spamd timingout after its caller has given up rather than before.

On the second, I wonder if anyone else is seeing this, and cluesappreciated.

I have no recent SA timeouts logged recently on any of the systems Imanage.

The most common reason for SA to hit its internal timeout is thecombination of a rule with a pattern that can generate a large number ofbacktracks while scanning (exponential or factorial order) and a messagewhich causes such backtracking. Typically that's caused by a '*' or '+'in a pattern where a fixed range for the number of repeats should beused instead. A few years ago we tried to fix all cases of dangerousrules in the default ruleset, and I think we succeeded. I believe theKAM rules have also been audited for likely problems. If you have anyunbounded wildcards in your local rules, tightening those rules upshould be your first step. If you can't find and fix the problematicrule by eye, you can get clues about it by scanning a problematicmessage with the "-D all" option to get a detailed rundown of what SAdoes in scanning a message. That will show you what rules are checkedsuccessfully. You can find a problematic rule by comparing that debugoutput from a bad message to that of a message which doesn't hang SA.



--
Bill Cole
b...@scconsult.com or billc...@apache.org
(AKA @grumpybozo and many *@billmail.scconsult.com addresses)
Not Currently Available For Hire

Re: timeouts on processing some messages, started October 24

Reply via email to