On Sat, Jun 01, 2002 at 10:45:58AM -0700, Craig R Hughes wrote:
| dman wrote:
| 
| d> Also keep a way to test the raw text itself, since some junk is easily
| d> identified by the raw format and not so obvious after it has been
| d> prepared for display.
| 
| Well, we can have the charset translation stuff happen between
| rawbody and body -- so rawbody will have the original untrsanslated
| stuff, and body tests will have the processed stuff.  Would that
| work, or do we need to have an extra test stage, beyond the
| "charset_ok" stuff?

I guess that will work.  I'm just trying to think ahead, and some of
the tests (like malformed content, <iframe, or other encoded stuff)
might be easier to find before things are decoded.  Maybe have some
stages between "completely raw", "converted to UTF-8", "Q-P/base64
decoded".  The real test as to whether or not this is useful is to see
how easy it is to make rules at each stage.

-D

-- 

"...In the UNIX world, people tend to interpret `non-technical user' as
meaning someone who's only ever written one device driver."
    --Daniel Pead
 
GnuPG key : http://dman.ddts.net/~dman/public_key.gpg

Attachment: msg05970/pgp00000.pgp
Description: PGP signature

Reply via email to