I just figured out that the attachment exclusion thing doesn't work as desired so I removed everything pertaining to that (oops). The chance of a false positive occurring are very low even without the ability to exclude inline attachments that might contain raw scripting or HTML code.

Please use this updated file instead if you want to test out the filter. Also please post any FP's that you believe should be counterbalanced in the test like the Ticketmaster example.

Thanks,

Matt
# OBFUSCATION
# Last Update: 09/14/2003
#
# Description:
# Encoding of letters and numbers in E-mail is unnecessary, however various techniques 
are
# sometimes used by spammers to hide from filters, even mixing multiple techniques in 
URL's at
# times.  This filter will detect text and URL encoding only in combinations where 
multiple
# encoded numbers and characters are in succession or mixed with HTTP address 
components.  More
# information on URL obfuscation techniques can be found at: 
http://www.pc-help.org/obscure.htm
#
# Usage:
# OBFUSCATION     filter     C:\IMail\Declude\Obfuscation.txt     x     7     0
#
# False Positives:
# Web designers and programmers passing inline code, ASCII text art, and legitimate 
bulk mailers
# that needlessly URL encode letters and numbers in their script arguments (only 
special
# characters are necessary).  False positives are extremely rare.


# Counterbalances:
# Negative weighting is applied for responsible bulk mailers that fail this test.
#
# Test Exclusions:
# Ticketmaster.

MAILFROM        -7      ENDSWITH        ticketmaster.com


# URL Encoded Obfuscation:
# This technique is used to obfuscate URL's.  The filter will only match two 
characters in
# succession with the first being a letter or number in order to protect form false 
positives.
#
# Example:
# http://%77%77%77.%67%6F%6F%67%6C%65.%63%6F%6D/

# 0-9

BODY    0       CONTAINS        %30%
BODY    0       CONTAINS        %31%
BODY    0       CONTAINS        %32%
BODY    0       CONTAINS        %33%
BODY    0       CONTAINS        %34%
BODY    0       CONTAINS        %35%
BODY    0       CONTAINS        %36%
BODY    0       CONTAINS        %37%
BODY    0       CONTAINS        %38%
BODY    0       CONTAINS        %39%

# A-Z

BODY    0       CONTAINS        %41%
BODY    0       CONTAINS        %42%
BODY    0       CONTAINS        %43%
BODY    0       CONTAINS        %44%
BODY    0       CONTAINS        %45%
BODY    0       CONTAINS        %46%
BODY    0       CONTAINS        %47%
BODY    0       CONTAINS        %48%
BODY    0       CONTAINS        %49%
BODY    0       CONTAINS        %4a%
BODY    0       CONTAINS        %4b%
BODY    0       CONTAINS        %4c%
BODY    0       CONTAINS        %4d%
BODY    0       CONTAINS        %4e%
BODY    0       CONTAINS        %4f%
BODY    0       CONTAINS        %50%
BODY    0       CONTAINS        %51%
BODY    0       CONTAINS        %52%
BODY    0       CONTAINS        %53%
BODY    0       CONTAINS        %54%
BODY    0       CONTAINS        %55%
BODY    0       CONTAINS        %56%
BODY    0       CONTAINS        %57%
BODY    0       CONTAINS        %58%
BODY    0       CONTAINS        %59%
BODY    0       CONTAINS        %5a%

# a-z

BODY    0       CONTAINS        %61%
BODY    0       CONTAINS        %62%
BODY    0       CONTAINS        %63%
BODY    0       CONTAINS        %64%
BODY    0       CONTAINS        %65%
BODY    0       CONTAINS        %66%
BODY    0       CONTAINS        %67%
BODY    0       CONTAINS        %68%
BODY    0       CONTAINS        %69%
BODY    0       CONTAINS        %6a%
BODY    0       CONTAINS        %6b%
BODY    0       CONTAINS        %6c%
BODY    0       CONTAINS        %6d%
BODY    0       CONTAINS        %6e%
BODY    0       CONTAINS        %6f%
BODY    0       CONTAINS        %70%
BODY    0       CONTAINS        %71%
BODY    0       CONTAINS        %72%
BODY    0       CONTAINS        %73%
BODY    0       CONTAINS        %74%
BODY    0       CONTAINS        %75%
BODY    0       CONTAINS        %76%
BODY    0       CONTAINS        %77%
BODY    0       CONTAINS        %78%
BODY    0       CONTAINS        %79%
BODY    0       CONTAINS        %7a%

# With HTTP

BODY            0       CONTAINS        http://%
BODY            0       CONTAINS        [EMAIL PROTECTED]
BODY            0       CONTAINS        %.%


# HTML Encoded Obfuscation:
# This technique is used to obfuscate URL's and hide keywords.  The filter will only 
match
# two characters in succession with the first being a letter or number in order to 
protect
# form false positives.
#
# Examples:
# <A 
HREF="http://&#119;&#119;&#119;.&#103;&#111;&#111;&#103;&#108;&#101;.&#99;&#111;&#109;/";>Google</A>
# V&#73;AG&#82;A

# 0-9

BODY            0       CONTAINS        &#48;&#
BODY            0       CONTAINS        &#49;&#
BODY            0       CONTAINS        &#50;&#
BODY            0       CONTAINS        &#51;&#
BODY            0       CONTAINS        &#52;&#
BODY            0       CONTAINS        &#53;&#
BODY            0       CONTAINS        &#54;&#
BODY            0       CONTAINS        &#55;&#
BODY            0       CONTAINS        &#56;&#
BODY            0       CONTAINS        &#57;&#

# A-Z

BODY            0       CONTAINS        &#65;&#
BODY            0       CONTAINS        &#66;&#
BODY            0       CONTAINS        &#67;&#
BODY            0       CONTAINS        &#68;&#
BODY            0       CONTAINS        &#69;&#
BODY            0       CONTAINS        &#70;&#
BODY            0       CONTAINS        &#71;&#
BODY            0       CONTAINS        &#72;&#
BODY            0       CONTAINS        &#73;&#
BODY            0       CONTAINS        &#74;&#
BODY            0       CONTAINS        &#75;&#
BODY            0       CONTAINS        &#76;&#
BODY            0       CONTAINS        &#77;&#
BODY            0       CONTAINS        &#78;&#
BODY            0       CONTAINS        &#79;&#
BODY            0       CONTAINS        &#80;&#
BODY            0       CONTAINS        &#81;&#
BODY            0       CONTAINS        &#82;&#
BODY            0       CONTAINS        &#83;&#
BODY            0       CONTAINS        &#84;&#
BODY            0       CONTAINS        &#85;&#
BODY            0       CONTAINS        &#86;&#
BODY            0       CONTAINS        &#87;&#
BODY            0       CONTAINS        &#88;&#
BODY            0       CONTAINS        &#89;&#
BODY            0       CONTAINS        &#90;&#

# a-z

BODY            0       CONTAINS        &#97;&#
BODY            0       CONTAINS        &#98;&#
BODY            0       CONTAINS        &#99;&#
BODY            0       CONTAINS        &#100;&#
BODY            0       CONTAINS        &#101;&#
BODY            0       CONTAINS        &#102;&#
BODY            0       CONTAINS        &#103;&#
BODY            0       CONTAINS        &#104;&#
BODY            0       CONTAINS        &#105;&#
BODY            0       CONTAINS        &#106;&#
BODY            0       CONTAINS        &#107;&#
BODY            0       CONTAINS        &#108;&#
BODY            0       CONTAINS        &#109;&#
BODY            0       CONTAINS        &#110;&#
BODY            0       CONTAINS        &#111;&#
BODY            0       CONTAINS        &#112;&#
BODY            0       CONTAINS        &#113;&#
BODY            0       CONTAINS        &#114;&#
BODY            0       CONTAINS        &#115;&#
BODY            0       CONTAINS        &#116;&#
BODY            0       CONTAINS        &#117;&#
BODY            0       CONTAINS        &#118;&#
BODY            0       CONTAINS        &#119;&#
BODY            0       CONTAINS        &#120;&#
BODY            0       CONTAINS        &#121;&#
BODY            0       CONTAINS        &#122;&#

# With HTTP

BODY            0       CONTAINS        http://&#
BODY            0       CONTAINS        ;@&#
BODY            0       CONTAINS        ;.&#


# Combination URL and HTML Encoded Obfuscation:
# This technique is used to obfuscate URL's.  The filter will match two differently 
encoded
# characters in succession or separated by HTTP address components.
#
# Example:
# <A HREF="http://%77&#119;%77.&#103;%6F&#111;%67&#108;%65.&#99;%6F&#109;/";>Google</A>

BODY            0       CONTAINS        %&#
BODY            0       CONTAINS        %@&#
BODY            0       CONTAINS        ;@%
BODY            0       CONTAINS        %.&#
BODY            0       CONTAINS        ;.%


# Hexadecimal IP Obfuscation
# This technique is used to obfuscate IP addresses.  The filter will only match a 
hexadecimal
# number found immediately following one of three possible HTTP address components.  
Commented
# out due to a lack of current use in spam.
#
# Example:
# http://0xd8.0xef.0x25.0x64/

#BODY   0       CONTAINS        http://0x0
#BODY   0       CONTAINS        http://0x1
#BODY   0       CONTAINS        http://0x2
#BODY   0       CONTAINS        http://0x3
#BODY   0       CONTAINS        http://0x4
#BODY   0       CONTAINS        http://0x5
#BODY   0       CONTAINS        http://0x6
#BODY   0       CONTAINS        http://0x7
#BODY   0       CONTAINS        http://0x8
#BODY   0       CONTAINS        http://0x9
#BODY   0       CONTAINS        http://0xa
#BODY   0       CONTAINS        http://0xb
#BODY   0       CONTAINS        http://0xc
#BODY   0       CONTAINS        http://0xd
#BODY   0       CONTAINS        http://0xe
#BODY   0       CONTAINS        http://0xf
#BODY   0       CONTAINS        @0x0
#BODY   0       CONTAINS        @0x1
#BODY   0       CONTAINS        @0x2
#BODY   0       CONTAINS        @0x3
#BODY   0       CONTAINS        @0x4
#BODY   0       CONTAINS        @0x5
#BODY   0       CONTAINS        @0x6
#BODY   0       CONTAINS        @0x7
#BODY   0       CONTAINS        @0x8
#BODY   0       CONTAINS        @0x9
#BODY   0       CONTAINS        @0xa
#BODY   0       CONTAINS        @0xb
#BODY   0       CONTAINS        @0xc
#BODY   0       CONTAINS        @0xd
#BODY   0       CONTAINS        @0xe
#BODY   0       CONTAINS        @0xf
#BODY   0       CONTAINS        .0x0
#BODY   0       CONTAINS        .0x1
#BODY   0       CONTAINS        .0x2
#BODY   0       CONTAINS        .0x3
#BODY   0       CONTAINS        .0x4
#BODY   0       CONTAINS        .0x5
#BODY   0       CONTAINS        .0x6
#BODY   0       CONTAINS        .0x7
#BODY   0       CONTAINS        .0x8
#BODY   0       CONTAINS        .0x9
#BODY   0       CONTAINS        .0xa
#BODY   0       CONTAINS        .0xb
#BODY   0       CONTAINS        .0xc
#BODY   0       CONTAINS        .0xd
#BODY   0       CONTAINS        .0xe
#BODY   0       CONTAINS        .0xf


# Octal IP Obfuscation:
# This technique is used to obfuscate IP addresses.  Due to the possibility of false 
positives,
# the filter will only match an HTTP address component followed by two zeros which 
indicates a
# high probability of an octal number, though only one zero is required for proper 
encoding.  
# Commented out due to a lack of current use in spam.
#
# Example:
# http://0330.000357.0063.00000144/

#BODY   0       CONTAINS        http://00
#BODY   0       CONTAINS        @00

Reply via email to