On 1/15/2013 4:05 PM, Bowie Bailey wrote:
> On 1/15/2013 3:47 PM, Ben Johnson wrote:
>> One final question on this subject (sorry...).
>>
>> Is there value in training Bayes on messages that SA classified as spam
>> *due to other test scores*? In other words, if a message is classified
>> as SPAM due to a block-list test, but the message is new enough for
>> Bayes to assign a zero score, should that message be kept and fed to
>> sa-learn so that Bayes can soak-up all the tokens from a message that is
>> almost certainly spam (based on the other tests)?
>>
>> Am I making any sense?
> 
> It is always worthwhile to train Bayes.  In an ideal world, you would
> hand-sort and train every email that comes through your system.  The
> more mail Bayes sees the more accurate it can be.
> 

Thanks, Bowie. Given your response, would it then be prudent to call
"sa-learn --spam" on any message that *other tests* (non-Bayes tests)
determine to be spam (given some score threshold)?

The crux of my question/point is that I don't want to have to feed
messages that Bayes "misses" but that other tests identify *correctly*
as spam to "sa-learn --spam".

Is there value in implementing something like this? Or is there some
caveat that would make doing so self-defeating?

Thanks a bunch,

-Ben

Reply via email to