On 03.07.18 12:17, J Doe wrote:
From reading the documentation, it appears that to train the Bayesian
filter I require a minimum of 1,000 pieces of ham and 1,000 pieces of
spam.

no. You need at least 200 hams and spams for bayes to start firing but you
can tune it bu setting bayes_min_ham_num and bayes_min_spam_num.

note that too few mails trained can result in false positives/negatives.

I am currently collecting spam on one of my servers via a spam trap
address and slowly reaching that number.  I was wondering, though, if I
can use auto learning (bayes_auto_learn 1), before training the database ?

autolearning does training instead of you. manual training is still faster
and more precise.

When autolearn fires on messages at the moment, it is correctly detecting
ham and spam based on the default ham and spam thresholds:

   bayes_auto_learn_threshold_nonspam 0.1
   bayes_auto_learn_threshold_spam 12.0

Can this be used before training the database or is it more often used to
supplement (on an ongoing basis), a database that has already be trained ?

those don't contradict each other.
you can use manual and automatic learning both.

--
Matus UHLAR - fantomas, uh...@fantomas.sk ; http://www.fantomas.sk/
Warning: I wish NOT to receive e-mail advertising to this address.
Varovanie: na tuto adresu chcem NEDOSTAVAT akukolvek reklamnu postu.
Chernobyl was an Windows 95 beta test site.

Reply via email to