I've been looking into the same situation actually. I haven't gotten far enough to try yet because of different things I've seen. Domino changes the headers around somewhat, especially the X-headers - converting them to X_ headers. I'm not sure if it even keeps them on a forwarded message (since I had a user forward me a false positive and I had no X_SPAM_* headers from SA).
I'm looking into a method to archive (or log) all messages on the Linux box prior to delivery to Domino. Anyone have an easy way to do that either with SA or sendmail? I thought I had seen a document somewhere allowing mail to be delivered to a SPAM or HAM mailbox as well as the original recipients but can't find it now. Thanks, Matt Tencati SAES Pure Gas, Inc. http://www.puregastechnologies.com Thomas Cameron <[EMAIL PROTECTED]> To: [EMAIL PROTECTED] Sent by: cc: [EMAIL PROTECTED] Subject: Re: [SAtalk] Can base64 encoded messages learnable by sa-learn? eforge.net 09/04/2003 01:44 PM On Thu, 2003-09-04 at 13:47, Matt Kettler wrote: > At 10:59 AM 9/4/2003 -0500, Thomas Cameron wrote: > >All - > > > >I have a client with a Red hat 9 + SA 2.55 + spamass-milter server in > >front of a Lotus Notes server. The client has hired a Notes developer > >to give their users an extra button in Notes to forward false negatives > >to a spam account the Linux server so I can run sa-learn --spam on the > >messages. The thing is, the messages get sent as base64 encoded > >attachments (see example below). > > > >Can sa-learn use this format to learn from? > > Well, bayes can learn from base 64 messages, but you absolutely should not > use forwarded messages for bayes training. > > The problem is that the bayes engine winds up learning "anything that looks > like it was forwarded via lotus notes is spam", which is clearly not the > desired effect. > > For bayes to work the message fed to bayes must not be modified in any way > from what it looks like as it comes in from the network. The bayes engine > does also examine some headers, so those need to be the same as the > originals as well, possibly with the exception of added Received: lines. > > Anything else is a recipe for a badly trained bayes database. > > See the spamassassin FAQ as well: > > http://spamassassin.taint.org/faq/index.cgi?req=show&file=faq05.003.htp > > > > > > > ------------------------------------------------------- > This sf.net email is sponsored by:ThinkGeek > Welcome to geek heaven. > http://thinkgeek.com/sf > _______________________________________________ > Spamassassin-talk mailing list > [EMAIL PROTECTED] > https://lists.sourceforge.net/lists/listinfo/spamassassin-talk I agree that it's not the best choice to use a forwarded message, but Notes apparently has no way to extract raw messages. Believe me, I am *totally* open to any suggestions. I believe that this is the least evil way to do it, but I would love to be proven wrong. -- Thomas Cameron, RHCE, CNE, MCSE, MCT Cameron Technical Services, Inc. http://www.camerontech.com/ (512) 454-3200 ------------------------------------------------------- This sf.net email is sponsored by:ThinkGeek Welcome to geek heaven. http://thinkgeek.com/sf _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk ------------------------------------------------------- This sf.net email is sponsored by:ThinkGeek Welcome to geek heaven. http://thinkgeek.com/sf _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk