-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

David B Funk writes:
>On Mon, 12 Jan 2004, Larry Starr wrote:
>
>> Just noticed a message with an encoded URL, that misses, the "BIZ_TLD" rule,
>> etc.
>>
>> The message body contains:
>> <a href=3d"http://gf=2eclearmath=2ebiz/jsimp/index=2ehtml";><font
>> face=3d"arial">scored </font>this way=2e
>>   <br><img src=3d"http://K=2eclearmath=2ebiz/images/js02=2ejpg"; border=3d=
>> "0">
>> </a>
>>
>> I know this wraps a bit ugly, when pasted into my mailer but, as you can see,
>> the punctuation, in the URI, is all hex encoded. "=2e", instead of ".".
>>
>> I have a local rule, in the form of bigevil.cf, with the following
>> sub-expression, that catches the above, but there has got to be a simpler way
>> to do this.
>>
>> uri  uri MyEvilList_001     ( /\b(?:=2e){0,1}clearmath(?:\.|=2e)biz)\b\i
>>
>> Does anyone know of a ruleset that handles this sort of thing, perhaps code
>> that decodes the "=xx" expressions prior to the "URI" matches?
>
>Actually that is a bastardized "quoted-printable" (QP) encoding of a URL.
>In QP the character sequence '=2E' is an encoded period, that spam-tool
>is generating '=2e' intending it to be interpreted as a period.
>
>SA is supposed to decode QP before running the various 'body' and 'uri'
>rules but there's a limitation in its decoding engine. If the QP
>encoding uses lower-case hex digits instead of CAPS hex digits, it
>does not recognize them as QP and fails to decode them.
>
>Strictly speaking RFC-2045 demands the usage of CAPS hex digits in
>QP (see section 6.7) and the lowercase stuff should be considered
>illegal.
>However many popular mail clients will decode the bastardized
>lowercase version and display the message to the user as the
>spammer intends (section 6.7, note (1) permits this).
>
>I can see two different ways to handle this, either make SA more
>flexible and decode the bastardized QP so normal rules will hit
>or write a rule that hits such bastardized QP coding as a spam-tool
>signature.

Are you sure about this?  If it's the case, we do need to
decode it, and it would be great to have it reported as a bug.

- --j.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.3 (GNU/Linux)
Comment: Exmh CVS

iD8DBQFABLQ7QTcbUG5Y7woRAuDDAKDFMaby+i43rPzYiMOkXTuzjmpVXgCcD0G0
BgL6yBaSrlsHTwoiRVGE45c=
=sjgq
-----END PGP SIGNATURE-----



-------------------------------------------------------
This SF.net email is sponsored by: Perforce Software.
Perforce is the Fast Software Configuration Management System offering
advanced branching capabilities and atomic changes on 50+ platforms.
Free Eval! http://www.perforce.com/perforce/loadprog.html
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to