On 28-09-2011 13:20, Benny Pedersen wrote:
I train Bayes manually on the borderline cases, but also have
auto-learning enabled. Is that really a bad idea? Should I disable it,
delete the bayes-databases and start over on manual-only learning?
no training is always good
Are you missing a comma? Do you mean "no, training is always good" or
"no training is always good"?
what score are you learning on ?, default is -0.1 and 12.0, i have
changed them here to -4 and 14
Can't find any settings to that effect, so I guess I am using defaults.
I have entered your settings in my config now.
Looking at
http://spamassassin.apache.org/full/3.3.x/doc/Mail_SpamAssassin_Conf.html#learning_options
i see an option called "bayes_use_hapaxes" that promises significantly
better hit-rates, but also increases database size by a factor of 8 to
10. What is the recommendation on this? If throughput is a factor in
this decision, we are scanning about 60,000 to 90,000 mails a day.
what plugins have you enabled ?
DCC
pyzor/razor
SpamCop
AutoLearnThreshold
TextCat
MIMEHeader
ReplaceTags
DKIM
Check
HTTPSMismatch
URIDetail
Bayes
All the EvalTest plugins
VBounce
ImageInfo
FreeMail
3dr party rules or just default sa 3.3.2 ?
Default and Sought Rules.
--
Lars