On 09/23/12 18:28, John Hardin wrote:
> On Sun, 23 Sep 2012, James wrote:
> 
>> I wrote this little script to update the bayes rules. I can do this on my 
>> imap account but my pop3 account gets way more spam and the messages are no 
>> longer on the machine with sa once I pop them off.
>>
>> Any comments on my script?
> 
> Bear in mind that spams which don't score high enough to be quarantined or 
> discarded will end up in your inbox, as will false negatives. Training all of 
> the mail in all of your inboxes as ham will train these messages as ham and 
> make any small error in classification much worse over time.
I will manually move spam to an Junk (not a .INBOX*
> 
> During the initial training period you want to manually review messages and 
> build a ham corpus and a spam corpus. Once bayes is running you generally 
> only want to train on misclassified messages. This decisionmaking process 
> cannot be automated, or the errors wouldn't occur in the first place.
> 
> You should set up per-user train-as-ham and train-as-spam mailboxes, and only 
> train from those, only for the users whose judgement you trust. Then, those 
> users should copy misclassified messages to the appropriate folder and may 
> also add samples of ham to the train-as-ham folder whenever desired.

The only user is me. :-)
Is there a way to convert my Thurderbird bayes to spamassassin?
> 
>>
>> #!/bin/bash
>>
>> IFS=$'\n'
>> FOLDERLIST=`find Maildir -name .INBOX\* -type d;`
>>
>> for i in $FOLDERLIST; do
>>    echo "Processing ""$i"
>> #    `sudo sa-learn"--ham "$i"`
>> done
>>
>> #`sudo sa-learn --spam Maildir/.Junk
>>
> 
> 

Reply via email to