At 04:34 AM 1/9/2005 +0700, you wrote:
Hi all,
Greetings. I've just joined the list.

I've been using sa-learn with SA 2.64 and 3.0.2
One thing is bugging me though. Is it safe to teach SA on a very long spam
such as the stock report spam? Will it cause many False Positive?

Why would you think it would?

By trying to avoid training that message you're poisoning your bayes database for false negatives.

Train spam as spam, train ham as ham. Let the statistics deal with the overlap. By trying to avoid training "spamish" ham or "hamish" spam you're just doing your training a big disservice by making it unrealistic.




Reply via email to