Re: [SAtalk] Can base64 encoded messages learnable by sa-learn?

Matt Tencati Thu, 04 Sep 2003 15:33:28 -0700

I've been looking into the same situation actually.  I haven't gotten far enough to try
yet because of different things I've seen.  Domino changes the headers around somewhat,
especially the X-headers - converting them to X_ headers.  I'm not sure if it even 
keeps
them on a forwarded message (since I had a user forward me a false positive and I had 
no
X_SPAM_* headers from SA).


I'm looking into a method to archive (or log) all messages on the Linux box prior to
delivery to Domino.  Anyone have an easy way to do that either with SA or sendmail?  I
thought I had seen a document somewhere allowing mail to be delivered to a SPAM or HAM
mailbox as well as the original recipients but can't find it now.

Thanks,

Matt Tencati
SAES Pure Gas, Inc.
http://www.puregastechnologies.com



                                                                                       
                                                                             
                    Thomas Cameron                                                     
                                                                             
                    <[EMAIL PROTECTED]>          To:     [EMAIL PROTECTED]             
                                          
                    Sent by:                                  cc:                      
                                                                             
                    [EMAIL PROTECTED]       Subject:     Re: [SAtalk] Can base64 
encoded messages learnable by sa-learn?                          
                    eforge.net                                                         
                                                                             
                                                                                       
                                                                             
                                                                                       
                                                                             
                    09/04/2003 01:44 PM                                                
                                                                             
                                                                                       
                                                                             
                                                                                       
                                                                             




On Thu, 2003-09-04 at 13:47, Matt Kettler wrote:
> At 10:59 AM 9/4/2003 -0500, Thomas Cameron wrote:
> >All -
> >
> >I have a client with a Red hat 9 + SA 2.55 + spamass-milter server in
> >front of a Lotus Notes server.  The client has hired a Notes developer
> >to give their users an extra button in Notes to forward false negatives
> >to a spam account the Linux server so I can run sa-learn --spam on the
> >messages.  The thing is, the messages get sent as base64 encoded
> >attachments (see example below).
> >
> >Can sa-learn use this format to learn from?
>
> Well, bayes can learn from base 64 messages, but you absolutely should not
> use forwarded messages for bayes training.
>
> The problem is that the bayes engine winds up learning "anything that looks
> like it was forwarded via lotus notes is spam", which is clearly not the
> desired effect.
>
> For bayes to work the message fed to bayes must not be modified in any way
> from what it looks like as it comes in from the network. The bayes engine
> does also examine some headers, so those need to be the same as the
> originals as well, possibly with the exception of added Received: lines.
>
> Anything else is a recipe for a badly trained bayes database.
>
> See the spamassassin FAQ as well:
>
> http://spamassassin.taint.org/faq/index.cgi?req=show&file=faq05.003.htp
>
>
>
>
>
>
> -------------------------------------------------------
> This sf.net email is sponsored by:ThinkGeek
> Welcome to geek heaven.
> http://thinkgeek.com/sf
> _______________________________________________
> Spamassassin-talk mailing list
> [EMAIL PROTECTED]
> https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

I agree that it's not the best choice to use a forwarded message, but
Notes apparently has no way to extract raw messages.

Believe me, I am *totally* open to any suggestions.  I believe that this
is the least evil way to do it, but I would love to be proven wrong.
--
Thomas Cameron, RHCE, CNE, MCSE, MCT
Cameron Technical Services, Inc.
http://www.camerontech.com/
(512) 454-3200



-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk






-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Re: [SAtalk] Can base64 encoded messages learnable by sa-learn?

Reply via email to