On 11/30/07 12:57 PM, "Kevin Parris" <[EMAIL PROTECTED]> wrote:

> If I have followed the discussion correctly so far, the explanation for
> manual-learn not being distinguished from auto-learn is this:  no matter what
> mode of learning caused a token to appear in the database, if there is ongoing
> mail traffic that "hits" on the token then said token will not expire out
> anyway.
> 
> In other words, tokens don't expire because of where or how they came to be
> listed, they expire because no more incoming mail traffic references them.  If
> you manually train a message that is the ONLY instance of that particular spam
> to slip through your other filter, and your Bayes never sees another message
> that matches the tokens it generated, then those tokens are irrelevant
> regardless of learn mode.

That makes sense, except that if that type of message shows up infrequently,
and your token database turns over several times a day because of the high
volume of auto-learn...  If I've taken the time to send it in for a
manual-learn, I'd expect it to be remembered for a while, even if the
message only shows up every couple of days.

I guess the flip side is that if a message is manually learned, and then you
continue to get messages in like that (at least more than the turnover
frequency), then the manually-learned information should stay active.
Correct?

Wes


Reply via email to