Ah. Jo, sounds like your rule will need a little more work in that case. ;)
--j. Theo Van Dinter writes: > FWIW, I responded a few days ago with an explanation of why the rule isn't > hitting. It has nothing to do with content-type headers and everything to do > with the fact that the message body isn't empty, there's HTML content. > > > On Thu, Aug 16, 2007 at 10:15:03AM +0100, Justin Mason wrote: > > > > Jo -- > > > > I've checked that in as 'TVD_PDF_FINGER01_JO'. You can track its progress > > at http://ruleqa.SpamAssassin.org . > > > > by the way -- it's pretty easy for you to test your own rules in your own > > environment, actually, and I recommend you try it out. These are the > > tools we use: > > > > http://wiki.apache.org/spamassassin/MassCheck > > http://wiki.apache.org/spamassassin/HitFrequencies > > > > They are bundled with SpamAssassin in the "masses" folder. All the > > documentation is there on the wiki. > > > > --j. > > > > Jo Rhett writes: > > > Since nobody is paying attention, let me clarify. The current rule is > > > wrong: > > > > > > mimeheader __TVD_MIME_ATT_AP Content-Type =~ /^application\/pdf/i > > > mimeheader __TVD_MIME_ATT_AOPDF Content-Type =~ > > > /^application\/octet-stream.*\.pdf/i > > > > > > meta TVD_PDF_FINGER01 __TVD_MIME_CT_MM && __TVD_MIME_ATT_TP && > > > __TVD_MIME_ATT && !__TVD_BODY > > > > > > This evaluates to exactly the same as this: > > > > > > meta TVD_PDF_FINGER01 __TVD_MIME_CT_MM && __TVD_MIME_ATT_TP && > > > !__TVD_BODY > > > > > > I believe that the original rule's intent was this: > > > > > > meta TVD_PDF_FINGER01 __TVD_MIME_CT_MM && __TVD_MIME_ATT && !__TVD_BODY > > > > > > Can someone with commit rights please test and commit this change? > > > Thank you. > > > > > > Jo Rhett wrote: > > > > Well actually I think the rule has a bug. Why OR the two mime types as > > > > a new meta, and then require one of the two in the final meta? The > > > > net > > > > effect is that if ATT_TP is true it matches, but if ATT_AOPDF is true > > > > it > > > > will never match. > > > > > > > > I believe that the following will work better - work in every situation > > > > that it worked before, and not fail when the mime type is octet-stream: > > > > meta TVD_PDF_FINGER01 __TVD_MIME_CT_MM && __TVD_MIME_ATT > > > > && > > > > !__TVD_BODY > > > > > > > > Would someone kindly evaluate this change and possibly fix the rule? > > > > Thanks. > > > > > > > > On Aug 14, 2007, at 10:41 PM, Loren Wilton wrote: > > > >>>> rawbody __TVD_BODY /\S{4}/ > > > >> > > > >> true > > > >> > > > >>>> header __TVD_MIME_CT_MM Content-Type =~ /^multipart\/mixed/i > > > >> > > > >> true > > > >> > > > >>>> mimeheader __TVD_MIME_ATT_AP Content-Type =~ /^application\/pdf/i > > > >> > > > >> false > > > >> > > > >>>> mimeheader __TVD_MIME_ATT_AOPDF Content-Type =~ > > > >>>> /^application\/octet-stream.*\.pdf/i > > > >> > > > >> maybe true, maybe not. I would hope newlines were translated to > > > >> spaces by the mimehdr plugin, but maybe they weren't. Try /is instead > > > >> of /i and see if it helps. > > > >> > > > >>>> meta __TVD_MIME_ATT __TVD_MIME_ATT_AP || > > > >>>> __TVD_MIME_ATT_AOPDF > > > >> > > > >> maybe true > > > >> > > > >>>> meta TVD_PDF_FINGER01 > > > >> __TVD_MIME_CT_MM > > > >> true > > > >> && __TVD_MIME_ATT_TP > > > >> undefined here, can't say > > > >> && __TVD_MIME_ATT > > > >> maybe true > > > >> && !__TVD_BODY > > > >> true > > > >> > > > >> So, not knowing what is in __TVD_MIME_ATT_TP, I haven't a clue if it > > > >> will fire, since that is part of an 'and'. If I assume it to be true > > > >> then I'm still not sure because of the multiline possibility in > > > >> __TVD_MIME_ATT. > > > >> > > > >> Loren > > > >> > > > >>>> describe TVD_PDF_FINGER01 Mail matches standard pdf spam > > > >>>> fingerprint > > > >> > > > >> > > > >> ----- Original Message ----- From: "Jo Rhett" <[EMAIL PROTECTED]> > > > >> To: "SpamAssassin Users" <users@spamassassin.apache.org> > > > >> Sent: Tuesday, August 14, 2007 10:16 PM > > > >> Subject: Re: PDF rule not matching -- split line content type? > > > >> > > > >> > > > >>> Can someone clue me in on why this rule isn't matching? > > > >>> > > > >>> Jo Rhett wrote: > > > >>>> So I've been getting a metric ton of PDF spam. Investigating the > > > >>>> rule that is supposed to match this, I see > > > >>>> > > > >>>> rawbody __TVD_BODY /\S{4}/ > > > >>>> header __TVD_MIME_CT_MM Content-Type =~ /^multipart\/mixed/i > > > >>>> meta __TVD_MIME_ATT __TVD_MIME_ATT_AP || > > > >>>> __TVD_MIME_ATT_AOPDF > > > >>>> meta TVD_PDF_FINGER01 __TVD_MIME_CT_MM && > > > >>>> __TVD_MIME_ATT_TP && __TVD_MIME_ATT && !__TVD_BODY > > > >>>> describe TVD_PDF_FINGER01 Mail matches standard pdf spam > > > >>>> fingerprint > > > >>>> > > > >>>> mimeheader __TVD_MIME_ATT_AP Content-Type =~ /^application\/pdf/i > > > >>>> mimeheader __TVD_MIME_ATT_AOPDF Content-Type =~ > > > >>>> /^application\/octet-stream.*\.pdf/i > > > >>>> > > > >>>> The following message appears to match perfectly with this, except > > > >>>> for perhaps that the content type is spread across two lines? I > > > >>>> haven't checked the code, but would this matter? > > > >>>> > > > >>>> Return-Path: <[EMAIL PROTECTED]> > > > >>>> Received: from mail.netconsonance.com ([unix socket]) > > > >>>> by triceratops.netconsonance.com (Cyrus v2.3.8) with LMTPA; > > > >>>> Tue, 14 Aug 2007 06:27:16 -0700 > > > >>>> Received: from [84.21.29.58] ([84.21.29.58]) > > > >>>> by mail.netconsonance.com (8.14.1/8.14.1) with ESMTP id > > > >>>> l7EDR4UU095951 > > > >>>> for <[EMAIL PROTECTED]>; Tue, 14 Aug 2007 06:27:08 -0700 (PDT) > > > >>>> (envelope-from [EMAIL PROTECTED]) > > > >>>> X-Virus-Scanned: amavisd-new at netconsonance.com > > > >>>> X-Spam-Score: 2.033 > > > >>>> X-Spam-Level: ** > > > >>>> X-Spam-Status: No, score=2.033 tagged_above=-999 required=4 > > > >>>> tests=[DK_POLICY_SIGNSOME=0.001, HTML_MESSAGE=0.001, > > > >>>> MIME_HTML_MOSTLY=0.699, RCVD_IN_BL_SPAMCOP_NET=1.332] > > > >>>> Received: from x-6of7ca27m39al ([158.187.61.7]) by [84.21.29.58] > > > >>>> with Microsoft SMTPSVC(6.0.3790.1830); > > > >>>> Tue, 14 Aug 2007 15:27:01 +0200 > > > >>>> Message-ID: <[EMAIL PROTECTED]> > > > >>>> From: "Yohann michels" <[EMAIL PROTECTED]> > > > >>>> To: [EMAIL PROTECTED] > > > >>>> Subject: bill-jrhett > > > >>>> Date: Tue, 14 Aug 2007 15:26:28 +0200 > > > >>>> MIME-Version: 1.0 > > > >>>> Content-Type: multipart/mixed; > > > >>>> boundary="----=_NextPart_000_000E_01C7DE87.7C1E24D0" > > > >>>> X-Priority: 3 > > > >>>> X-MSMail-Priority: Normal > > > >>>> X-Mailer: Microsoft Outlook Express 6.00.2900.3138 > > > >>>> X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.3138 > > > >>>> > > > >>>> > > > >>>> ------=_NextPart_000_000E_01C7DE87.7C1E24D0 > > > >>>> Content-Type: multipart/alternative; > > > >>>> boundary="----=_NextPart_001_000F_01C7DE87.7C1E24D0" > > > >>>> > > > >>>> > > > >>>> ------=_NextPart_001_000F_01C7DE87.7C1E24D0 > > > >>>> Content-Transfer-Encoding: quoted-printable > > > >>>> Content-Type: text/plain; > > > >>>> charset=windows-1250 > > > >>>> > > > >>>> > > > >>>> ------=_NextPart_001_000F_01C7DE87.7C1E24D0 > > > >>>> Content-Transfer-Encoding: quoted-printable > > > >>>> Content-Type: text/html; > > > >>>> charset=windows-1250 > > > >>>> > > > >>>> <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"> > > > >>>> <HTML><HEAD> > > > >>>> <META http-equiv=3DContent-Type content=3D"text/html; = > > > >>>> charset=3Dwindows-1250"> > > > >>>> <META content=3D"MSHTML 6.00.2900.3132" name=3DGENERATOR> > > > >>>> <STYLE></STYLE> > > > >>>> </HEAD> > > > >>>> <BODY bgColor=3D#ffffff> > > > >>>> <DIV><FONT face=3DArial size=3D2></FONT> </DIV></BODY></HTML> > > > >>>> > > > >>>> ------=_NextPart_001_000F_01C7DE87.7C1E24D0-- > > > >>>> > > > >>>> ------=_NextPart_000_000E_01C7DE87.7C1E24D0 > > > >>>> Content-Transfer-Encoding: base64 > > > >>>> Content-Type: application/octet-stream; > > > >>>> name=marketing-jrhett.pdf > > > >>>> Content-Disposition: attachment; > > > >>>> filename=marketing-jrhett.pdf > > > >>>> > > > >>>> JVBERi0xLjUNJeLjz9MNCjIyIDAgb2JqPDwvSFs0MzYgMTQ4XS9MaW5lYXJpemVkIDEvRSAxNjU5 > > > >>>> > > > >>>> L0wgMTM1NzYvTiAxMC9PIDI2L1QgMTMwNzQ+Pg1lbmRvYmoNICAgICAgICAgICAgICAgICAgICAg > > > >>>> > > > >>>> *snip* > > > >>>> > > > >>>> > > > >>> > > > >>> > > > >>> -- > > > >>> Jo Rhett > > > >>> Net Consonance ... net philanthropy, open source and other randomness > > > >> > > > >> > > > > > > > > > > > > > -- > > > Jo Rhett > > > Net Consonance ... net philanthropy, open source and other randomness > > -- > Randomly Selected Tagline: > "Premature optimisation is the root of all evil." - Knuth