dman wrote: > What is the charset GB2312? Is it a windows thing? Trying to display > this spam message just gives me an odd sequence of high-order latin1 > chars.
It's an encoding for Chinese characters. > Also, what is =?UNKNOWN?Q? in a header (eg Subject:)? I know it's > supposed to give the charset used in the header, but UNKNOWN? You > don't know what charset you used!? Come on. There are lots of broken mailers out there, sadly. > Both of these (found in the same message just now) seem like good > reasons to junk the message. > > I just discovered an error in the SUBJ_ALL_CAPS rule, I think. If > the subject is encoded in a dfferent charset than US-ASCII it could > have "all caps" but really be other characters. When that rule is > evaluated, is it evaluated on the raw header or the decoded version? Unfortunately it's done on the raw headers at the moment. Hopefully with the port to MIME::tools (or my custom mail parser) that will change, but I don't think MIME::tools supports that many character sets - just the ISO-8859 ones (the one I wrote supports anything that iconv supports on your system). Matt. _______________________________________________________________ Don't miss the 2002 Sprint PCS Application Developer's Conference August 25-28 in Las Vegas -- http://devcon.sprintpcs.com/adp/index.cfm _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk