Re: Spamassassin not capturing obvious Spam

Reindl Harald Tue, 31 May 2016 06:33:14 -0700


Am 31.05.2016 um 15:28 schrieb Antony Stone:

2. You should be aware (*especially* if using this stuff as the basis of a
research project - any competent referee should pick up on something like
this) that SA works best when the emails it is asked to process are from the
same source as it has been trained with.  In other words, you shovel real
emails through a real mail server and train SA using this spam and ham; you
then use that trains SA to assess mail passing through that same mail server,
for the same users.  Anything significantly varying from this is not going to
work well, and is certainly not a good test of how well SA works.

not true - i heard similar nonsense about "you can't re-use you MX bayes database on a submission server" - i can, do and it works like a charm

our current corpus is 90000 mails large, conatins samples in many languages for many users (site-wide setup) and that bayes is shared with another company for more than a year now and has the same results there as here (96% hit quote)

signature.asc
Description: OpenPGP digital signature

Re: Spamassassin not capturing obvious Spam

Reply via email to