On 01/05/13 19:40, Andrew Talbot wrote:
Hi, Seve -
Thanks for your response. Is that just for performance reasons?
Performance is one of the things that bayes_auto_learn_on_error 1 will
give you. It means that if the message was already considered spam by
Bayes, then the message won't be autolearnt again which means a bit less
IO. It will also result in the Bayes databases being smaller as it is
likely that with this option that less tokens will be present overall
which will also save disk IO and space.
But the key reason I like this option is that it doesn't allow bayes to
overtrain in one direction (e.g. spam or ham). It only autolearns when
Bayes either has the wrong result or isn't sure which IMO has to be
better for accuracy in the long run.
Kind regards,
Steve.