dman wrote:
> What is the charset GB2312?  Is it a windows thing?  Trying to display
> this spam message just gives me an odd sequence of high-order latin1
> chars.

It's an encoding for Chinese characters.

> Also, what is =?UNKNOWN?Q? in a header (eg Subject:)?  I know it's
> supposed to give the charset used in the header, but UNKNOWN?  You
> don't know what charset you used!?  Come on.

There are lots of broken mailers out there, sadly.

> Both of these (found in the same message just now) seem like good
> reasons to junk the message.  
> 
> I just discovered an error in the SUBJ_ALL_CAPS rule, I think.  If
> the subject is encoded in a dfferent charset than US-ASCII it could
> have "all caps" but really be other characters.  When that rule is
> evaluated, is it evaluated on the raw header or the decoded version?

Unfortunately it's done on the raw headers at the moment. Hopefully with 
the port to MIME::tools (or my custom mail parser) that will change, but 
I don't think MIME::tools supports that many character sets - just the 
ISO-8859 ones (the one I wrote supports anything that iconv supports on 
your system).

Matt.


_______________________________________________________________

Don't miss the 2002 Sprint PCS Application Developer's Conference
August 25-28 in Las Vegas -- http://devcon.sprintpcs.com/adp/index.cfm

_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to