On 2015-09-04, Adam Wolk <adam.w...@tintagel.pl> wrote: > It's quite possible that Bayesian filtering started working for me only > since this snapshot. I would appreciate it if you could check the size > of your bayes_toks db & some info on general growth per email (seems to > be around 30-60M on my server) as that's the only thing I think could > be wrong with it atm. 65.3G accumulated in less than 24h for a DB that > serves around 11k emails *per month* seems a lot (and most of that > traffic are OpenBSD mailing lists).
That definitely seems wrong, my bayes_toks from 500-1000 mails/day with amavis+spamassassin is around 5MB. I'm not sure where to start looking though, I'd probably try wiping the db and starting again, though the only time I remember having to do that myself is when someone was relaying spam through a host in DNSWL which got auto-learned as ham (i.e bogus data not corruption). $ sudo -u _vscan sa-learn --dump magic 0.000 0 3 0 non-token data: bayes db version 0.000 0 4202 0 non-token data: nspam 0.000 0 1799 0 non-token data: nham 0.000 0 151022 0 non-token data: ntokens 0.000 0 1422052584 0 non-token data: oldest atime 0.000 0 1441425256 0 non-token data: newest atime 0.000 0 1441426503 0 non-token data: last journal sync atime 0.000 0 1441412100 0 non-token data: last expiry atime 0.000 0 0 0 non-token data: last expire atime delta 0.000 0 0 0 non-token data: last expire reduction count