Training Bayes on outbound mail

Dominic Benson Fri, 28 Jan 2011 10:10:44 -0800

Hi -

Recently, in order to balance the ham/spam ratio given to sa-learn, Ihave started to pass mail submitted by authenticated users to sa-learn--ham.The thinking here is that users would generally want to receive mailthat they send, and many messages will either be replies or replied to,so this is likely to have a fair amount in common with legitimate mailcoming in.The existing bayes training was from auto-learn, on 60k ham and 360kspam; since starting to do this, nearly twice as much ham as spam hasbeen learned.

I haven't seen any mention of this strategy on-list or on the web, soI'm interested in whether (a) anyone else does this, and (b) is there agood reason not to do it that I haven't thought of?

The approach, if anyone is interested, is to use an "unseen" Exim routerto pipe mail to sa-learn --ham using the pipe transport, on thecondition that an acl_m variable, set for authenticated users inacl_check_rcpt, evaluates to true.


Dominic

Training Bayes on outbound mail

Reply via email to