Is testing for gibberish in email address worth attempting?

In some basic testing I took just the first part of the email address (the
left side of the @) and made a simple scoring system based on the
switching of the characters from one type to another (alpha, numeric,
other).

The first character is given free of course.  If the next character is a
different type 1 point is added.  Subsequent characters of the same type
do not increase the score.  Changes in case could be done as well -
thinking .5 for that score.

So just as an example the address.

3w2g66afv6

Gives

3       0
w       1
2       1
g       1
6       1
6       0
a       1
f       0
v       0
6       1

6 changes for 10 characters.

The best number range so far for detecting gibberish like this seems to be
right around the 1/3 to 1/2 of the score.  More testing would have to
be done to find the right number.

Could this be of use?  The affects would be small but I'd imagine every
little bit helps.

Jason Portwood
[EMAIL PROTECTED]






-------------------------------------------------------
This SF.net email is sponsored by:  Etnus, makers of TotalView, The best
thread debugger on the planet. Designed with thread debugging features
you've never dreamed of, try TotalView 6 free at www.etnus.com.
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to