Re: BAYES_999 strange behavior

Kevin A. McGrail Wed, 19 Feb 2014 07:21:21 -0800

On 2/19/2014 9:37 AM, Bowie Bailey wrote:

On 2/18/2014 8:49 PM, Kevin A. McGrail wrote:
On 2/18/2014 6:05 PM, Amir Caspi wrote:
On Feb 18, 2014, at 3:58 PM, John Hardin <jhar...@impsec.org> wrote:
Is there some reason the Bayes scores can't/shouldn't be static?
Indeed, I am wondering why Bayes would be auto-scored at all. Bydefinition, Bayes high scores should match only on spam, low scoresshould match only on ham. That's not perfect, of course, but it isbasically by definition of how Bayes learns.
Given that, it seems to me that the Bayes scores should be static,and my experience suggests that 99 or 999 should be scored prettyheavily. (I'd say 00 should be scored negatively heavily, but I getenough FNs with 00 that I don't like that idea... though it probablymeans my DB is borked or my ham is full of spammy tokens.)
Actually it's a bit the opposite especially if using autolearn where
scoring to high on the 99% end can cause low percentage corpora to swing
heavily towards the high score too rapidly.
Bayes scores are not included when determining what to autolearn, sochanging the Bayes scores should have no effect on autolearning.
Or am I missing something?

I would have to look at the permutations of bayes_auto_learn_on_error,bayes_auto_learn_threshold_spam and the tflag autolearn_force to answerthat question but my memory is that this is a self-perpetuating cyclethat I've seen on live servers when testing scoring.


regards,
KAM

Re: BAYES_999 strange behavior

Reply via email to