On Sun, May 26, 2002 at 10:00:32PM -0400, Duncan Findlay wrote: > On Sun, May 26, 2002 at 10:55:34PM +0200, Tony L. Svanstrom wrote: > > I think so. Rules designed to catch spam, scored negatively, even if > they occur more frequently in non-spam than spam, are NOT good > indicators of spam. They are merely bad/false indicators of spam, and > the regexp's should be changes to make them better spam indicators. > > If we want to have negative scoring rules, we should try to put > together regexp's that are actually non-spam indicators. The > DEAR_SOMEBODY rules is a perfect example. "Dear Sir/Madam" is a sign > of spam, "Dear Duncan" is not. I think we should add: >
I realise, I'm rather North American-biased. But certainly stuff like "Dear IT Professional," is spam. What about "Dear Sir/Madam?" Is this common in business e-mail? Ususally, I would imagine, you know the gender of your correspondant. Maybe break this into 2 rules, instead of the following: > body DEAR_SIR /Dear (?:Sir|Madam|IT\b|friend\W|Internet)/i > describe DEAR_SIR How dear am I? You don't know my name! use: body DEAR_SIR_MADAM /Dear (?:Sir|Madam)/i describe DEAR_SIR_MADAM Contains Dear Sir or Madam. body DEAR_FRIEND /Dear (?:IT\b|friend\W|Internet)/i describe DEAR_FRIEND How dear am I? You don't know my name! and include: > body DEAR_EMAIL /Dear [A-Za-z0-9_-]+\@/ > describe DEAR_EMAIL How dear am I? You call me by my e-mail address! However, if DEAR_SIR_MADAM is negatively scored, I'd vote it be scrapped. Spammers will simply start beginning their mails with "Dear Sir/Madam," (I just can't wait till I see that greeting on a PENIS_ENLARGE e-mail!) -- Duncan Findlay _______________________________________________________________ Don't miss the 2002 Sprint PCS Application Developer's Conference August 25-28 in Las Vegas -- http://devcon.sprintpcs.com/adp/index.cfm _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk