Re: re-learning ? was - bayes - large message

Joe Acquisto-j4 Sat, 20 Apr 2013 07:59:31 -0700

. . . 
>    Ok, I am officially puzzled.   
>    
>    I setup email addresses on my SA box, to which I and others (they say) 
> send ham/spam.  Then I have cron tasks that feed those emails twice daily to 
> bayes.  And emails the output to my admin mailbox.
>    
>    I can review those admin messages and see "Learned tokens from n 
> message(s) (n message(s) examined)".   Yet, if i resend the bayes food from 
> those dates, it appears to re-learn them.   I would expect "Learned tokens 
> from 0 messages(s) (n messages(s). . . "
>    if it already had seen them.
>    
>    I have tried this for several dates and get the same result.  What could 
> it be?  Not Operator Trouble, surely . . .
>    
>    joe a
> 
> Bayes uses the message id from the email message to remember which
> messages it has seen.  If you are really emailing the messages then
> you are getting a new message-id which is then learned.  You need to
> train on the unadulterated original email message.  You can do this by
> attaching the complete email message.  Otherwise you are training
> bayes to recognize tokens added by your users during the forwarding
> process as a spam indicator.
> 
> -jeff


To attempt to answer several at one go -

I do email them to the Linux box that hosts SA.  Postfix is in use there.   
spam and ham each have their own "mailbox" there, which appears to be 
essentially a file named for the account.

In order to send the samples, the user will forward the messages, as an 
attachment.  Each is an individual message to either ham or spam, with the 
(hopefully) correct attachment.

I think this meets requirements.  ?   But, if it is reading the forwarders 
info, that explains why it is "re-learning" and suggests my bayes data is 
worthless.

joe a.

Re: re-learning ? was - bayes - large message

Reply via email to