Re: Obfuscation (was: Millions and Billions)

2005-02-28 Thread Kenneth Porter
--On Sunday, February 27, 2005 7:46 PM -0800 Loren Wilton <[EMAIL PROTECTED]> wrote: He has a point. A complicated regex is complicated, and that can mean slow. It also by definition means "incomprehensible to humans", and so has to be generated by a tool, and then not touched or looked at. Than

Re: Obfuscation (was: Millions and Billions)

2005-02-28 Thread Dawn Keenan
Chris Santerre wrote: > I remember that paper. I was impressed and sceptical at the same time. I > could see it FPing a lot. One person in the crowd brought up Niagra vs. the > V-drug word :) > > Cialis vs. Dial-Lisa > ect.. That was MailFrontier, using the term lexigraphical distancing rat

RE: Obfuscation (was: Millions and Billions)

2005-02-28 Thread Chris Santerre
>-Original Message- >From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] >Sent: Monday, February 28, 2005 10:34 AM >To: Loren Wilton >Cc: users@spamassassin.apache.org >Subject: Re: Obfuscation (was: Millions and Billions) > > >-BEGIN PGP SIGNED MESSAGE-

Re: Obfuscation (was: Millions and Billions)

2005-02-28 Thread Justin Mason
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Loren Wilton writes: > Since a tool can generate the matching pattern and convert it to a re, it > seems that a tool could in theory generate a matching pattern and convert it > to something else that might be either more comprehensible or more > effi

Re: Obfuscation (was: Millions and Billions)

2005-02-28 Thread Robert Menschel
Hello Kenneth, Sunday, February 27, 2005, 7:35:18 AM, you wrote: KP> --On Thursday, February 24, 2005 6:07 PM -0500 Phil Barnett KP> <[EMAIL PROTECTED]> wrote: >> i or l = [|ííiil1] >> a = [EMAIL PROTECTED] >> e = [eé3] >> o = [o0] KP> It seems like this is getting overly-complicated. Are there

Re: Obfuscation (was: Millions and Billions)

2005-02-28 Thread Loren Wilton
> > I just question whether regex's are the right "complicated solution". > > > > How does Google or one of the dictionary sites guess the correct spelling > > for a misspelled word? > > Great, why don't you go see if google can guess the correct spelling for > > c l @ L i @ s He has a point. A c

Re: Obfuscation (was: Millions and Billions)

2005-02-27 Thread Phil Barnett
On Sunday 27 February 2005 06:31 pm, Kenneth Porter wrote: > --On Sunday, February 27, 2005 11:48 AM -0500 Phil Barnett <[EMAIL PROTECTED]> > > wrote: > > All you have requested here is for someone else to do the complicated > > stuff and make it easy for you. Someone has to get the code as comple

Re: Obfuscation (was: Millions and Billions)

2005-02-27 Thread Kenneth Porter
--On Sunday, February 27, 2005 11:48 AM -0500 Phil Barnett <[EMAIL PROTECTED]> wrote: All you have requested here is for someone else to do the complicated stuff and make it easy for you. Someone has to get the code as complex as it needs to be. If not you, then the guy that makes the library y

Re: Obfuscation (was: Millions and Billions)

2005-02-27 Thread Phil Barnett
On Sunday 27 February 2005 10:35 am, Kenneth Porter wrote: > --On Thursday, February 24, 2005 6:07 PM -0500 Phil Barnett > > <[EMAIL PROTECTED]> wrote: > > i or l = [|ííiil1] > > > > a = [EMAIL PROTECTED] > > > > e = [eé3] > > > > o = [o0] > > It seems like this is getting overly-complicated. Are t

Obfuscation (was: Millions and Billions)

2005-02-27 Thread Kenneth Porter
--On Thursday, February 24, 2005 6:07 PM -0500 Phil Barnett <[EMAIL PROTECTED]> wrote: i or l = [|ííiil1] a = [EMAIL PROTECTED] e = [eé3] o = [o0] It seems like this is getting overly-complicated. Are there any libraries for doing fuzzy string matching and obfuscation detection that could be us

Re: Millions and Billions

2005-02-25 Thread Duncan Hill
On Thursday 24 February 2005 21:54, Stuart Johnston wrote: > I've been seeing a ton of stock spam this week, no URLs - no SURBL :( > Bayes and Razor, etc pick up on it eventually but to speed things up, I > wrote a rule. One thing that is unique about these messages is that > they replace l's with

Re: Millions and Billions

2005-02-24 Thread Phil Barnett
On Thursday 24 February 2005 05:42 pm, [EMAIL PROTECTED] wrote: > Stuart Johnston wrote: > > [EMAIL PROTECTED] wrote: > >> Stuart Johnston wrote: > >>> body L_MILLBILL /[mb]i(?:\|l|l\||\|\|)ions?/i > >> > >> body L_MILLBILL /[mb]i[l|][l|]ions?/i > > > > I started with something similar to that but

Re: Millions and Billions

2005-02-24 Thread Chris Thielen
Hi, [EMAIL PROTECTED] wrote: Stuart Johnston wrote: [EMAIL PROTECTED] wrote: Stuart Johnston wrote: body L_MILLBILL /[mb]i(?:\|l|l\||\|\|)ions?/i body L_MILLBILL /[mb]i[l|][l|]ions?/i I started with something similar to that but it will also match millions which we do

Re: Millions and Billions

2005-02-24 Thread David B Funk
On Thu, 24 Feb 2005, Stuart Johnston wrote: > [EMAIL PROTECTED] wrote: [snip..] > > > > How about (slightly easier to read) > > body L_MILLBILL /[mb]i[l|][l|]ions?/i > > or even > > body L_MILLBILL /[mb]i[l|]{2}ions?/i > > I started with something similar to that but it will also match millions >

RE: Millions and Billions

2005-02-24 Thread Matthew.van.Eerde
Stuart Johnston wrote: > [EMAIL PROTECTED] wrote: >> Stuart Johnston wrote: >> >>> body L_MILLBILL /[mb]i(?:\|l|l\||\|\|)ions?/i >> body L_MILLBILL /[mb]i[l|][l|]ions?/i > I started with something similar to that but it will also match > millions which we don't want. Touché! OK, how about body L

Re: Millions and Billions

2005-02-24 Thread Stuart Johnston
[EMAIL PROTECTED] wrote: Stuart Johnston wrote: body L_MILLBILL /[mb]i(?:\|l|l\||\|\|)ions?/i Feel free to use it, make suggestions or point out that I wasted my time writing a rule already available from SARE. Stuart Johnston How about (slightly easier to read) body L_MILLBILL /[mb]i[l|][l|]ions?

RE: Millions and Billions

2005-02-24 Thread Matthew.van.Eerde
Stuart Johnston wrote: > body L_MILLBILL /[mb]i(?:\|l|l\||\|\|)ions?/i > > Feel free to use it, make suggestions or point out that I wasted my > time writing a rule already available from SARE. > > Stuart Johnston How about (slightly easier to read) body L_MILLBILL /[mb]i[l|][l|]ions?/i or even

Millions and Billions

2005-02-24 Thread Stuart Johnston
I've been seeing a ton of stock spam this week, no URLs - no SURBL :( Bayes and Razor, etc pick up on it eventually but to speed things up, I wrote a rule. One thing that is unique about these messages is that they replace l's with |'s. They usually will have some variation on Mil|ions or Bi|