On 2015-09-04, Adam Wolk <adam.w...@tintagel.pl> wrote:
> It's quite possible that Bayesian filtering started working for me only
> since this snapshot. I would appreciate it if you could check the size
> of your bayes_toks db & some info on general growth per email (seems to
> be around 30-60M on my server) as that's the only thing I think could
> be wrong with it atm. 65.3G accumulated in less than 24h for a DB that
> serves around 11k emails *per month* seems a lot (and most of that
> traffic are OpenBSD mailing lists).

That definitely seems wrong, my bayes_toks from 500-1000 mails/day with
amavis+spamassassin is around 5MB. I'm not sure where to start looking
though, I'd probably try wiping the db and starting again, though the
only time I remember having to do that myself is when someone was relaying
spam through a host in DNSWL which got auto-learned as ham (i.e bogus data
not corruption).

$ sudo -u _vscan sa-learn --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0       4202          0  non-token data: nspam
0.000          0       1799          0  non-token data: nham
0.000          0     151022          0  non-token data: ntokens
0.000          0 1422052584          0  non-token data: oldest atime
0.000          0 1441425256          0  non-token data: newest atime
0.000          0 1441426503          0  non-token data: last journal sync atime
0.000          0 1441412100          0  non-token data: last expiry atime
0.000          0          0          0  non-token data: last expire atime delta
0.000          0          0          0  non-token data: last expire reduction 
count

Reply via email to