Re: Problems with Cyrillic spam

2011-12-15 Thread darxus
On 12/15, Martin Gregorie wrote: > In that case I'm missing some information: how to write a rule that can > interpret the value(s) returned by TextCat. I think you're looking for: ok_languages en fr de - http://spamassassin.apache.org/full/3.3.x/doc/Mail_SpamAssassin_Plugin_TextCat.html > Why

Re: Problems with Cyrillic spam

2011-12-15 Thread Martin Gregorie
On Thu, 2011-12-15 at 10:57 -0500, dar...@chaosreigns.com wrote: > On 12/15, Martin Gregorie wrote: > > The problem that needs addressing is that the ok_locales configuration > > parameter doesn't work. This appears to be because it thinks the > > sender's choice of (in Windows terms) the character

Re: Problems with Cyrillic spam

2011-12-15 Thread darxus
On 12/15, Martin Gregorie wrote: > The problem that needs addressing is that the ok_locales configuration > parameter doesn't work. This appears to be because it thinks the > sender's choice of (in Windows terms) the character translation code > page is a reliable indication of the sender's locale.

Re: Problems with Cyrillic spam

2011-12-15 Thread Martin Gregorie
On Wed, 2011-12-14 at 23:36 -0500, dar...@chaosreigns.com wrote: > On 12/15, Martin Gregorie wrote: > > Could somebody with access to the SA Bugzilla kindly add a comment to > > bug 4078 saying that this is also an issue with Cyrillic encoded in > > UTF-8? I'm asking because at present #4078 only m

Re: Problems with Cyrillic spam

2011-12-14 Thread darxus
On 12/15, Martin Gregorie wrote: > Could somebody with access to the SA Bugzilla kindly add a comment to > bug 4078 saying that this is also an issue with Cyrillic encoded in > UTF-8? I'm asking because at present #4078 only mentions Windows code > pages and koi8. There is nothing to indicate that

Re: Problems with Cyrillic spam

2011-12-14 Thread Martin Gregorie
On Wed, 2011-12-14 at 19:38 -0500, dar...@chaosreigns.com wrote: > On 12/15, Martin Gregorie wrote: > > I'm getting spam with the Subject, Sender personal name and body all > > written in Cyrillic, but, despite having "ok_locales en fr de" defined > > in local.cf, no rules are fired to mark the mes

Re: Problems with Cyrillic spam

2011-12-14 Thread Karsten Bräckelmann
On Thu, 2011-12-15 at 00:09 +, Martin Gregorie wrote: > I'm running SA 3.3.2 and would appreciate knowing how it recognises that > a message contains a language that is not listed as belonging to an OK > locale. It's based on the charset. For obvious reasons, UTF-8 is excluded here. What woul

Re: Problems with Cyrillic spam

2011-12-14 Thread darxus
On 12/15, Martin Gregorie wrote: > I'm getting spam with the Subject, Sender personal name and body all > written in Cyrillic, but, despite having "ok_locales en fr de" defined > in local.cf, no rules are fired to mark the message as being in an > unwanted language. Probably related to this: http