From: "Chavdar Videff" <[EMAIL PROTECTED]> > On Monday 11 July 2005 14:50, JamesDR wrote: > > Chavdar Videff wrote: > > > Hi List, > > > > > > Our mailserver server serves about 100 users. Our config: > > > Sendmail+Procmail+SpamAssassin. > > > The question is: > > > If I got it right, we should run sa-learn for each user in order to > > > benefit from bayes. We intend to run a cron job for each user and do it > > > at night by supplying a daily snapshot of our spam and ham collections to > > > sa-learn. Can our mailserver handle it (256 MB RAM, Celeron 400 Mhz)? > > > A weekly collection run for 1 user usually eats 100% of CPU load. My > > > concern is whether the system is going to crash or just do the job slower > > > and if you can point out how many sa-learn tasks could we run > > > simultaneously with our setup. > > > All hints will be appreciated, for we scheduled an initial load for 16 > > > users of the big collection of spam received so far. > > > > > > Thanks guys > > > > > > Chavdar Videff > > > > What kind of Bayes db are you using? We use MySQL here and haven't seen > > SA-Learn use up that much cpu... I've run it manually up to 10 processes > > at once without any noticeable slowing of the machine. (p2 450mhz, 256mb) > > I guess it is BerkeleyDB, the default installation on Debian. The ineteresting > part is that while testing cron on one user the cpu fall was not noticeable.
If feeding individual user Bayes feed with ham samples and spam samples submitted by the particular user for HER Bayes. If you have them all working off the same Bayes corpus then there is little or no gain to using per user Bayes. {^_^}