Paul Boven wrote:
Hi everyone,
The message-ID's of mails that have been (auto-)learned by Bayes are
stored indefinitely in bayes_seen. Which, over the years that we've used
SpamAssassin now, has grown to a 320MB file. We're using site-wide Bayes
databases. What would be the best way to trim down this database, safely?
Given that it only stores message-ID and spam status, I assume there is
no way to rescue more recent entries, and I'd have to wipe it altogether?
No replies yet, so I'll clarify my question a bit:
1.) How much of a performance impact would it have to have a Bayes_seen
that is this large?
2.) What is the safest way of trimming it down? Can I simply stop
SpamAssassin (called by Mimedefang in our case) and remove it, or do I
need to recreate it in some way?
It would perhaps be usefull if the Bayes seen database also had
timestamps, so this kind of purging could be done automatically and
properly.
Regards, Paul Boven.