Re: [SAtalk] A pointer for nailing Korean based spam

Derrick 'dman' Hudson Thu, 06 Jun 2002 05:34:41 -0700

On Sat, Jun 01, 2002 at 10:45:58AM -0700, Craig R Hughes wrote:
| dman wrote:
| 
| d> Also keep a way to test the raw text itself, since some junk is easily
| d> identified by the raw format and not so obvious after it has been
| d> prepared for display.
| 
| Well, we can have the charset translation stuff happen between
| rawbody and body -- so rawbody will have the original untrsanslated
| stuff, and body tests will have the processed stuff.  Would that
| work, or do we need to have an extra test stage, beyond the
| "charset_ok" stuff?


I guess that will work.  I'm just trying to think ahead, and some of
the tests (like malformed content, <iframe, or other encoded stuff)
might be easier to find before things are decoded.  Maybe have some
stages between "completely raw", "converted to UTF-8", "Q-P/base64
decoded".  The real test as to whether or not this is useful is to see
how easy it is to make rules at each stage.

-D

-- 

"...In the UNIX world, people tend to interpret `non-technical user' as
meaning someone who's only ever written one device driver."
    --Daniel Pead
 
GnuPG key : http://dman.ddts.net/~dman/public_key.gpg

msg05970/pgp00000.pgp
Description: PGP signature

Re: [SAtalk] A pointer for nailing Korean based spam

Reply via email to