On 11/12/2017 05:35 PM, micah wrote:
David Jones <djo...@ena.com> writes:
I am interested in seeing the bayes info in the database, because it was
created years ago
Spam changes all of the time so I train mine daily and manually expire
mine after about a month. Depending on your recipients, number of
mailboxes, and mail flow, you may be fine with not training that often
but I don't think tokens from years ago are going to be very accurate on
current mail flow.
A large list of whitelist_auth entries with well-trained Bayes and you
can bump up the BAYES_* scores with nice results.
How do you deal with a large user base with bayesian databases? It seems
like having a shared one just gets useless fast, but allocating an
individual database to each user is quite a hassle as well.
Servicing +40k users I use a central Redis based Bayes db in autolearn
mode. Works for me.