autolearn Re: major upgrade of spamassassin

2011-05-26 Thread Lucio Chiappetti
On Wed, 18 May 2011, Lucio Chiappetti wrote: We are in the process of doing a long awaited and overdue upgrade of our servers (from Suse 9.2 to 11.4), which involves upgrading the (bundled) spamassassin (used with sendmail and amavis milter) from 3.0 to 3.3. We progressed in our migration. We

Large (usually legitimate) HTML mails choking SA

2011-05-26 Thread Kris Deugau
Every so often we get a message or two stuck in our inbound mail queue because it took too long for SA to process during mail delivery. For a little while there were actually runs of pure HTML-garbage spam over 500K; those have been dealt with (and may have disappeared). However, we've just

Re: autolearn Re: major upgrade of spamassassin

2011-05-26 Thread Bowie Bailey
On 5/26/2011 12:57 PM, Lucio Chiappetti wrote: > We progressed in our migration. We have now a machine running with the > new configuration, duly trained on ham and spam collected in the last > weeks, and we diverted to it a fraction of our incoming messages (we > repointed the secondary MX for one

Re: Large (usually legitimate) HTML mails choking SA

2011-05-26 Thread Karsten Bräckelmann
On Thu, 2011-05-26 at 15:02 -0400, Kris Deugau wrote: > Every so often we get a message or two stuck in our inbound mail queue > because it took too long for SA to process during mail delivery. > However, we've just had a couple of *legitimate* messages get stuck for > essentially the same reaso

Re: Large (usually legitimate) HTML mails choking SA

2011-05-26 Thread Alex
Hi, >> Every so often we get a message or two stuck in our inbound mail queue >> because it took too long for SA to process during mail delivery. > >> However, we've just had a couple of *legitimate* messages get stuck for >> essentially the same reason - a whole lot of pathologically bad HTML. >

Re: Large (usually legitimate) HTML mails choking SA

2011-05-26 Thread Martin Gregorie
On Thu, 2011-05-26 at 16:37 -0400, Alex wrote: > Any tips on how to do that? When used in conjunction with amavis, is > there a way to identify which rule consumes the most processing time, > in the same way it can for bayes or SA overall? > By inspection, e.g. any rawbody rule whose regex contain

Re: Large (usually legitimate) HTML mails choking SA

2011-05-26 Thread John Hardin
On Thu, 26 May 2011, Kris Deugau wrote: Whitelisting these once they're found lets them bypass SA altogether, but in the meantime they get stuck in the mail queue. Has anyone got any suggestions for decreasing the load SA imposes trying to process one of these? Any possibility of getting a

Re: Large (usually legitimate) HTML mails choking SA

2011-05-26 Thread RW
On Thu, 26 May 2011 15:02:37 -0400 Kris Deugau wrote: > Every so often we get a message or two stuck in our inbound mail > queue because it took too long for SA to process during mail delivery. > > For a little while there were actually runs of pure HTML-garbage spam > over 500K; those have be