Christopher Jett wrote: > >> >> >> For the autowhitelist database, grab the check_whitelist script out of >> the tools subdirectory in the tarball. Run check_whitelist --clean on >> the AWL file. This will eliminate any "one-off" entries from it. Not >> much of an expiry, but its a start. (note: the lack of any reasonable >> form of expiry is one reason why I say the AWL isn't really ready for >> production use on any servers that have decent mail volume) > > OK - thanks. So, for example, it's safe to delete the bayes_seen file > after it gets over a certain size? Is there a particular size after > which performance degrades significantly?
No, there's no "cliff" in it. It should be something like O(n) or O(n log(n)). ie: the bigger the bayes seen, the larger the database to search when performing learning, so the longer that takes, but there's no cliff.. it's probably some kind of linear or close to linear relationship between size and speed here.