Re: normalize_charset effects

2018-11-14 Thread RW
On Wed, 14 Nov 2018 19:32:00 +0100 Matus UHLAR - fantomas wrote: > >On Wed, 14 Nov 2018 09:43:25 +0100 > >Matus UHLAR - fantomas wrote: > >> what are direct effects of normalize_charset? > > On 14.11.18 14:37, RW wrote: > >It causes mime text parts that a

Re: normalize_charset effects

2018-11-14 Thread Matus UHLAR - fantomas
On Wed, 14 Nov 2018 09:43:25 +0100 Matus UHLAR - fantomas wrote: what are direct effects of normalize_charset? On 14.11.18 14:37, RW wrote: It causes mime text parts that aren't UTF-8 to be translated into UTF-8. does this apply only for rules or even for things like bayes? I mean, w

Re: normalize_charset effects

2018-11-14 Thread RW
On Wed, 14 Nov 2018 09:43:25 +0100 Matus UHLAR - fantomas wrote: > Hello, > > what are direct effects of normalize_charset? It causes mime text parts that aren't UTF-8 to be translated into UTF-8. > Will enabling normalize_charset cause some immediate benefits or > disad

normalize_charset effects

2018-11-14 Thread Matus UHLAR - fantomas
Hello, what are direct effects of normalize_charset? Does it affect e.g. bayes? I found some slowness report https://bz.apache.org/SpamAssassin/show_bug.cgi?id=5691 https://rt.perl.org//Public/Bug/Display.html?id=66852 ...that should be fixed in 5.20 and discussion around: http

Re: normalize_charset

2016-04-30 Thread RW
s the debuggin result when /normal_charset/ > disabled: Images aren't very useful. We need to find a reproducible problem. It's very easy to find example of CJK rules that do work with "normalize_charset 1" on UTF-8 text, the following example came from text from a random spa

Re: normalize_charset

2016-04-29 Thread RW
> Am 29.04.2016 um 11:54 schrieb yblin: > > I can't filter the mail by body rules.However ,I can filter utf-8 > > encoding mails with /normalize_charset/ option disabled Do you have an example of a rule that doesn't work? Preferably a simple one.

Re: normalize_charset

2016-04-29 Thread Reindl Harald
Am 29.04.2016 um 11:54 schrieb yblin: I can't filter the mail by body rules.However ,I can filter utf-8 encoding mails with /normalize_charset/ option disabled stay on list! as said you need to utf8-enocde your body rules when it's done properly they work, proven by 1200 body-rule

Re: normalize_charset

2016-04-29 Thread Reindl Harald
s but also non-UTF-8 encoding emails if I enable the option "normalize_charset".However,when I use this option, a strange problem occures---I can not filter UTF-8 encoding or non-UTF-8 encoding emails with this option enabled but can filter UTF-8 encoding emails with this option disabled. Co

normalize_charset

2016-04-29 Thread yblin
if I enable the option "normalize_charset".However,when I use this option, a strange problem occures---I can not filter UTF-8 encoding or non-UTF-8 encoding emails with this option enabled but can filter UTF-8 encoding emails with this option disabled. Could you tell me what the proble

Re: Current best-practices around normalize_charset?

2014-03-14 Thread Ivo Truxa
From: "Jay Sekora [via SpamAssassin]" I forgot to comment on this: > Seems like just normalizing them to U+ might be better than > trying to transcribe them. (And that would let a brave or foolhardy > mail administrator write rules to match patterns seen in, say, > Chinese-language spam

Re: Current best-practices around normalize_charset?

2014-03-14 Thread Ivo Truxa
You need to install the Text::Unidecode module for the script to work (for the patch, of course, too). > Anyway, glad to hear that normalize_charset hasn't been causing you > problems, and for us, normalizing to UTF8 is almost certainly what we > want if it's reasonably saf

Re: Current best-practices around normalize_charset?

2014-03-14 Thread Jay A. Sekora
On Wed, 2014-03-12 at 19:04 -0700, Ivo Truxa wrote: > Your message is a few months old, but I see no answer, and stumbled upon it > when writing an enhanced version of the normalize_charset feature, so > thought that I could perhaps help. Thanks! I'm glad to hear of your expe

Re: Current best-practices around normalize_charset?

2014-03-12 Thread Ivo Truxa
Hello, Your message is a few months old, but I see no answer, and stumbled upon it when writing an enhanced version of the normalize_charset feature, so thought that I could perhaps help. Jay Sekora wrote > Hi. We're running SpamAssassin 3.3.1, and pursuant to some advice I'

Current best-practices around normalize_charset?

2013-07-16 Thread Jay Sekora
Hi. We're running SpamAssassin 3.3.1, and pursuant to some advice I've seen in archives of this list and spamassassin-dev (e.g., http://osdir.com/ml/spamassassin-dev/2009-07/msg00156.html), I am *not* using normalize_charset. Unfortunately, this makes filtering text in binary

Rules with non-ASCII characters and normalize_charset 1

2008-10-31 Thread Ben Winslow
guage (which is a maintenance nightmare and probably prone to false-positives), it looks like 'normalize_charset 1' should allow me to write the rules once in UTF-8, but this isn't the way it's working. I'm not entirely sure why it doesn't Just Work (without support for ca