angstrom
Armstrong
Bergstrom
birthplace
birthplaces
bremsstrahlung
corkscrew
Dijkstra
downstream
hardscrabble
jockstrap
Knightsbridge
lengthly
lengths
lengthwise
Lindstrom
Longstreet
Nietzsche
nightclub
Nordstrom
offspring
postscript
postscripts
Rothschild
sportswriter
sportswriting
strengths
switchblade
wavelengths
witchcraft
worthwhile
worthwhileness

That what 5 hits.

With 6 only
Knightsbridge


The dictionary I'm going against is just the standard one that comes
with redhat.  (grep -E '[bcdfghjklmnpqrstvwxz]{6}'
/usr/share/dict/linux.words) A more complete dictionary would result in
more hits.  I definitely like the idea, though!

Mike



-----Original Message-----
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of
Chris Santerre
Sent: Thursday, November 06, 2003 2:24 PM
To: 'Greg Webster'; [EMAIL PROTECTED]
Subject: RE: [SAtalk] 'random' character sets


 There is an excellent set of rules being tested now. Just more tweaking
needed. Your set is different. I'll give them a go and see how it pans
out!

--Chris

> -----Original Message-----
> From: Greg Webster [mailto:[EMAIL PROTECTED]
> Sent: Thursday, November 06, 2003 1:59 PM
> To: [EMAIL PROTECTED]
> Subject: [SAtalk] 'random' character sets
> 
> 
> A thought on spammers oft-used sets of 'random' character lists in
> emails...an example:
> 
> --
> gnqplleqhzblll
> u
>  wfjmvfe upvxoi lwhm
> xqs 
> flckwrtsmufx irwajksqsnw er wcfjgfmk jugxfq
> --
> 
> Seems to me that some tests can be made from these...
> body 10_CONSONENTS /[bcdfghjklmnpqrstvwxz]{10}/
> score GW_10_CONSONENTS                1.0
> body 9_CONSONENTS /[bcdfghjklmnpqrstvwxz]{9}/
> score GW_9_CONSONENTS         0.9
> body 8_CONSONENTS /[bcdfghjklmnpqrstvwxz]{8}/
> score GW_8_CONSONENTS         0.8
> body 7_CONSONENTS /[bcdfghjklmnpqrstvwxz]{7}/
> score GW_7_CONSONENTS         0.7
> body 6_CONSONENTS /[bcdfghjklmnpqrstvwxz]{6}/
> score GW_6_CONSONENTS         0.6
> body 5_CONSONENTS /[bcdfghjklmnpqrstvwxz]{5}/
> score GW_5_CONSONENTS         0.5
> 
> These have not been tested yet...
> 
> Some potential concerns:
> - Encoded messages will likely set this off (uuencode, binhex, etc.)
> - Are there many legitimate situations where 5+ consonents 
> will be seen?
> - Will other languages (such as German and Welsh with long strings of
> consonents) be penalized for using this?
> - Can we determine any other sorts of patterns from spammers use of
> these?
> 
> Any more thoughts?
> 
> Greg
> 
> -- 
> Greg Webster - [EMAIL PROTECTED]
> In-Touch Software Corporation
> Ph: (604)278-0515 - Fax: (604)608-3112
> 
> 
> 
> -------------------------------------------------------
> This SF.net email is sponsored by: SF.net Giveback Program.
> Does SourceForge.net help you be more productive?  Does it
> help you create better code?   SHARE THE LOVE, and help us help
> YOU!  Click Here: http://sourceforge.net/donate/
> _______________________________________________
> Spamassassin-talk mailing list
> [EMAIL PROTECTED]
> https://lists.sourceforge.net/lists/listinfo/spamassassin-talk
> 


-------------------------------------------------------
This SF.net email is sponsored by: SF.net Giveback Program.
Does SourceForge.net help you be more productive?  Does it
help you create better code?   SHARE THE LOVE, and help us help
YOU!  Click Here: http://sourceforge.net/donate/
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk


-------------------------------------------------------
This SF.net email is sponsored by: SF.net Giveback Program.
Does SourceForge.net help you be more productive?  Does it
help you create better code?   SHARE THE LOVE, and help us help
YOU!  Click Here: http://sourceforge.net/donate/
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to