On 05/26/2010 05:24 AM, Karsten Bräckelmann wrote: > > Unfortunately, in this case, the fact that it isn't a proper, raw > message is not irrelevant. The ok_locales setting, which is part of your > original question, depends on the char-set used. Which is missing from > the sample. We only can assume it was an UTF-8 encoded HTML document. > Even that is a legitimate corner case. What does SA do with an UTF8 email where that charset isn't explicitly mentioned, but the Content-Transfer-Encoding: is set to "8bit"? I think that is non-RFC compliant, but I also know that Thunderbird resolves it just fine (not that it should of) - so it's a "legitimate" way for a spammer to send spam.
Here's a link to the Greek one I got recently. UTF8, Greek and yet FARAWAY didn't trigger (I have "ok_locales en"). I even have TextCat enabled (didn't work for this email) - but I don't think it's used by the charset stuff anyway? http://pastebin.com/XyHU2krq -- Cheers Jason Haar Information Security Manager, Trimble Navigation Ltd. Phone: +64 3 9635 377 Fax: +64 3 9635 417 PGP Fingerprint: 7A2E 0407 C9A6 CAF6 2B9F 8422 C063 5EBB FE1D 66D1