Re: [SAtalk] Max children

Jack Gostl Sun, 22 Jun 2003 18:52:13 -0700

On Mon, 23 Jun 2003, Simon Byrnand wrote:

> At 20:54 22/06/03 -0400, Jack Gostl wrote:
> 
> 
> >On Mon, 23 Jun 2003, Simon Byrnand wrote:
> >
> > > At 08:38 18/06/03 -0400, Jack Gostl wrote:
> > >
> > > >I've seen this two or three times now, and I'm not sure what to make of
> > > >it.
> > > >
> > > >Outward appearance is that we get hit with a ton of spam, or perhaps that
> > > >an RBL goes out. I wind up with many copies of spamd running, many more
> > > >than the -m parameter should allow. (And forget about procmail and
> > > >sendmail. Hundreds of copies.) They never seem to go away, or at best, go
> > > >away very, very slowly.
> > >
> > > Yep, seen that before we upgraded our server recently. Basically when a
> > > flood of junk comes in, your server is trying to launch more copies of
> > > sendmail/procmail/spamd than it can handle, it runs out of memory, starts
> > > swapping, and gets bogged down. The more it swaps, the more it falls 
> > behind
> > > the incomming connection rate, the more processes that try to run at once,
> > > so the situation compounds itself. If you're unlucky it will not 
> > recover by
> > > itself.
> > >
> > > >The monitors show that disk activity is through the roof.
> > >
> > > Yep, and if you use vmstat you should see that it is swapping activity (si
> > > and so columns) that are the primary cause of the disk activity.
> > >
> > > >They all but cripple the machine. Then I go in and kill sendmail, kill all
> > > >the procmails and spamds, and restart things, and everything clear up very
> > > >quickly.
> > >
> > > Of course, because when you kill all those processes you free up physical
> > > memory and the machine is no longer in a state of swap thrashing, so it
> > > becomes responsive again almost immediately.
> > >
> > > >  So quickly that it makes me suspicious as to what was REALLY
> > > >wrong. Maybe something more than just load, or RBL problems. Maybe a
> > > >locking problem.
> > >
> > > Nope, this is a problem with stock installs of sendmail/procmail and/or a
> > > lack of memory. How much memory does the server have ? Roughly how many
> > > messages a day do you process ?
> >
> >We have 128mb of real memory, and we do around 4500 messages per day.
> 
> Hmm ok, 128MB of memory is a bit marginal in my experience. Increasing that 
> to 256 or 512 would probably make a BIG improvement. To give you an idea we 
> do around 8000-12000 messages a day, and our old server was a PIII-800 with 
> 128MB of ram. We also do virus scanning of email.
> 
> When I first introduced SpamAssassin I had the same problems as you - it'd 
> go fine for a while and then suddenly the machine would bog down to the 
> point where you could hardly even log in. Without SpamAssassin the machine 
> was easily up to the task of that volume of mail.
> 
> Now the machine is a P4 2.4Ghz with 1GB of ram and it absolutely flies :) 
> Although the CPU and disk access are a lot faster too, I attribute a lot of 
> that to having plenty of free ram available at all times, and spare ram is 
> automatically put to use for disk caching...
> 
> > > In your sendmail.cf you'll want to tune:
> > >
> > > QueueLA
> > > RefuseLA
> > > MaxDaemonChildren
> > > ConnectionRateThrottle
> >
> >Do you have any suggested values? And what version of sendmail are you
> >assuming. I don't see all those values. I have QueueLA and RefuseLA but
> >not the other two. I'm at sendmail 8.10.1.
> 
> Well I'm running 8.11.6. On the old machine with SA installed I ended up using
> 
> QueueLA=10
> RefuseLA=20
> MaxDaemonChildren=50
> ConnectionRateThrottle=10
> 
> On the new server I'm now using
> 
> QueueLA=15
> RefuseLA=25
> MaxDaemonChildren=100
> ConnectionRateThrottle=20
> 
> 
> > > of course what you tune them to depends on the specs of your server.
> >
> >We're running AIX. A dual processor but not a particularly fast
> >server. Never had any problems until I introduced SpamAssassin.
> >
> > > Using the -m option of spamd IMHO is a bad idea and compounds the problem,
> > > because it causes procmail and sendmail processes to *WAIT* when there is
> > > too much incomming stuff to handle at once, and all these processes 
> > waiting
> >
> >I'm beginning to agree.
> 
> With my script (snipped) on the old server I found it necessary to defer 
> scanning if the Load average was higher than 5, and if more than 15 local 
> deliveries were happening at once. Now on the new server I have the load 
> average threshold at 15 and 30 simultaneous deliveries. So far I've seen no 
> cases of bogging down.


Well... there is a new server on the way. Lots bigger and faster than this
one. I just need to get through a few weeks.

I'll try the sendmail throttling you suggested.

-- 

Jack Gostl      [EMAIL PROTECTED]



-------------------------------------------------------
This SF.Net email is sponsored by: INetU
Attention Web Developers & Consultants: Become An INetU Hosting Partner.
Refer Dedicated Servers. We Manage Them. You Get 10% Monthly Commission!
INetU Dedicated Managed Hosting http://www.inetu.net/partner/index.php
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Re: [SAtalk] Max children

Reply via email to