On Wed, 24 Nov 2004, Gustafson, Tim wrote: > How do you keep your ntokens so low? > > Mine averages ((nspam + nham) * 10). Yours is basically (nspam + nham). > Do you run some job that expires tokens or something? I'm running > sa-learn --force-expire once a day (and it takes about 2-3 minutes to > run) but the ntokens never seems to go down. :\ > > Tim
Tim, that's because your Bayes is FUBAR, you've got a "future" message in there that's fouling up your expire. Run it with a '--D' and look at the output, I'll bet that it doesn't actually expire anything. Revisting your '--dump magic' output again: [EMAIL PROTECTED] sa-learn --dump magic 0.000 0 2 0 non-token data: bayes db version 0.000 0 88033 0 non-token data: nspam 0.000 0 15592 0 non-token data: nham 0.000 0 1729756 0 non-token data: ntokens 0.000 0 1010964573 0 non-token data: oldest atime 0.000 0 1762110386 0 non-token data: newest atime 0.000 0 1101309901 0 non-token data: last journalsync atime 0.000 0 1101301792 0 non-token data: last expiry atime 0.000 0 0 0 non-token data: last expire atime delta 0.000 0 0 0 non-token data: last expire reduction count Note that 'newest atime' value, it's 21 years in the future. That is "poisoning" your expire, so it's not doing anyting. perl -e 'print scalar localtime(1762110386),"\n";' Sun Nov 2 13:06:26 2025 The 'ntokens' should be a more-or-less fixed value, based upon the setting of your "bayes_expiry_max_db_size". The values of nspam & nham should continually increase but ntokens should hit an upper bound and go no higher. On a busy system that's been running for a while, nspam & nham can easily out strip ntokens. Here's my stats: server15$ sa-learn --dump magic 0.000 0 2 0 non-token data: bayes db version 0.000 0 1275494 0 non-token data: nspam 0.000 0 525068 0 non-token data: nham 0.000 0 227192 0 non-token data: ntokens 0.000 0 1101252542 0 non-token data: oldest atime 0.000 0 1101360564 0 non-token data: newest atime 0.000 0 1101360564 0 non-token data: last journal sync atime 0.000 0 1101338998 0 non-token data: last expiry atime 0.000 0 86400 0 non-token data: last expire atime delta 0.000 0 73470 0 non-token data: last expire reduction count -- Dave Funk University of Iowa <dbfunk (at) engineering.uiowa.edu> College of Engineering 319/335-5751 FAX: 319/384-0549 1256 Seamans Center Sys_admin/Postmaster/cell_admin Iowa City, IA 52242-1527 #include <std_disclaimer.h> Better is not better, 'standard' is better. B{