Hi, all: I am using sa-learn to train my bayes filter. And I collect many known spams from our honey pot.
I found that there are so many mails with the same content in this spam corpus. Is it necessary to delete the repeated spams before sa-learn study? Thanks :) -- Xueron Nee <[EMAIL PROTECTED]>