dman wrote:
> On Tue, May 28, 2002 at 09:53:14AM +0100, Matt Sergeant wrote:
> | dman wrote:
> | >What is the charset GB2312?  Is it a windows thing?  Trying to
> display
> | >this spam message just gives me an odd sequence of high-order latin1
> | >chars.
> | 
> | It's an encoding for Chinese characters.
> 
> Ok, it's now being rejected as well.
> 
> | >Also, what is =?UNKNOWN?Q? in a header (eg Subject:)?  I know it's
> | >supposed to give the charset used in the header, but UNKNOWN?  You
> | >don't know what charset you used!?  Come on.
> | 
> | There are lots of broken mailers out there, sadly.
> 
> I've seen that a few times, but the only message I currently have like
> that is the spam message.  It doesn't indicate which mailer sent it.
> It does have some interesting headers, though :
> 
> From: ************@[202.101.10.130]
> Sender: EmailSender@[202.101.10.130]
> Reply-to: [EMAIL PROTECTED]
> 
> | >Both of these (found in the same message just now) seem like good
> | >reasons to junk the message.  
> | >
> | >I just discovered an error in the SUBJ_ALL_CAPS rule, I think.  If
> | >the subject is encoded in a dfferent charset than US-ASCII it could
> | >have "all caps" but really be other characters.  When that rule is
> | >evaluated, is it evaluated on the raw header or the decoded version?
> | 
> | Unfortunately it's done on the raw headers at the moment. Hopefully
> with 
> | the port to MIME::tools (or my custom mail parser) that will change,
> but 
> | I don't think MIME::tools supports that many character sets - just the
> 
> | ISO-8859 ones (the one I wrote supports anything that iconv supports
> on 
> | your system).
> 
> I think we need both raw and decoded header tests.  Checking for
> things like the UNKNOWN charset could be useful.  Do you know of a
> legitimate mailer that inserts that code?

I think pine or some other text based mailers might when they encounter 
stuff they don't recognise (like 8 bit chars when the terminal is set 
for ASCII).

I agree you need both. That's easy enough to do with my mail class (it's 
a one line change). Does anyone know if encoded headers in MIME 
sub-parts are legitimate? I seem to recall reading they're only valid in 
the main headers section, but I can't seem to find that now, and I 
suspect that mail clients will probable decompose encoded headers in 
mime sub-parts anyway.

Still, the code is there if someone wants to play with it.

Matt.


_______________________________________________________________

Don't miss the 2002 Sprint PCS Application Developer's Conference
August 25-28 in Las Vegas -- http://devcon.sprintpcs.com/adp/index.cfm

_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to