Hello Steve, Matt,

Wednesday, January 5, 2005, 10:27:32 AM, you wrote:

MK> At 12:26 PM 1/5/2005, Steve Bondy wrote:
>>Is it safe to write a rule that triggers on the content of the Message
>>ID header?

MK> Yes...

Agreed.

>>I frequently see things coming in which have message IDs that
>>include the domain name of the target mail system, rather than the
>>source mail system.  If I remember correctly, the message ID should
>>always be the source....

Also agreed with all the others -- this simple a rule does not work.

The SARE header rules files include several message-id rules that test
for patterns that are found in spam but not non-spam. Examples:

header    SARE_MSGID_LONG          MESSAGEID =~ /<.{135,}>/
describe  SARE_MSGID_LONG          Message ID is too long.
score     SARE_MSGID_LONG          1.183
#hist     SARE_MSGID_LONG          Jesse Houwing, August 20 2004
#max      SARE_MSGID_LONG          90s/0h of 70699 corpus (43133s/27566h RM) 
10/02/04
#counts   SARE_MSGID_LONG          16s/0h of 38748 corpus (15267s/23481h 
JH-SA3.0rc1) 08/19/04
#counts   SARE_MSGID_LONG          7s/0h of 34763 corpus (18647s/16116h MY) 
08/25/04

header    SARE_MSGID_SHORT         MESSAGEID =~ /^.{1,6}$/
describe  SARE_MSGID_SHORT         Message ID is too short to be valid. 
score     SARE_MSGID_SHORT         3.333
#stype    SARE_MSGID_SHORT         spamgg
#hist     SARE_MSGID_SHORT         RM_hm_ShortMsgid6
#counts   SARE_MSGID_SHORT         8s/0h of 68491 corpus (41115s/27376h RM) 
09/18/04
#max      SARE_MSGID_SHORT         191s/0h of 115925 corpus (94616s/21309h) 
05/01/04
#counts   SARE_MSGID_SHORT         24s/0h of 38398 corpus (14914s/23484h JH) 
08/14/04 TM2 SA3.0-pre2
#counts   SARE_MSGID_SHORT         68s/0h of 17050 corpus (14617s/2433h MY) 
08/08/04

If you find any other combinations that can be used to identify spam,
we're definitely interested in testing them.

Bob Menschel



Reply via email to