You are into the land of opinions here, so you will get different
answers.
1. The 200 ham and 200 spam is a hard
minimum. You can change this. But don't.
So you MUST give Bayes at least 200 each ham and spam before it will start
doing anything. What you give it for ham should hopefully be fairly
representative of the ham your sites really gets. Likewise the spam should
be moderately representative of average spam.
Once you have the basic stuff I personally prefer to leave auto-learning
turned off and only had Bayes hams and spams that might be misclassified, or
ones where the bayes score isn't high enough in the appropriate direction.
Others may want to do things differently.
Personally I'd say that you REALLY should turn off auto-learning at the
start, until you have got Bayes a good start in life by hand. Once you
have it working and you are happy with it you may want to turn auto-learning
back on, or may not. If you do turn it back on, you probably want to set
bayes-ham-threshold (or whatever the name really is) to around -.1 rather than
the default value.
Loren
|
- Re: Training Bayes properly jdow
- Re: Training Bayes properly jdow
- Re: Training Bayes properly Rick Macdougall
- Re: Training Bayes properly Loren Wilton
- Re: Training Bayes properly Anthony Peacock
- Re: Training Bayes properly Stefan Jakobs
- Re: Training Bayes properly jdow
- RE: Training Bayes properly Will Nordmeyer
- RE: Training Bayes properly Randal, Phil