Re: Operations on headers in UTF-8

2014-06-11 Thread Daniel Staal
--As of June 11, 2014 4:25:31 AM +0200, Karsten Bräckelmann is alleged to have said: On Tue, 2014-06-10 at 21:22 -0400, Daniel Staal wrote: --As of June 11, 2014 2:45:25 AM +0200, Karsten Bräckelmann is alleged to have said: > Worse, enabling charset normalization completely breaks UTF-8

Re: Operations on headers in UTF-8

2014-06-10 Thread Karsten Bräckelmann
On Tue, 2014-06-10 at 21:22 -0400, Daniel Staal wrote: > --As of June 11, 2014 2:45:25 AM +0200, Karsten Bräckelmann is alleged to > have said: > > Worse, enabling charset normalization completely breaks UTF-8 chars > > in the regex. At least in my ad-hoc --cf command line testing. > > --

Re: Operations on headers in UTF-8

2014-06-10 Thread Daniel Staal
--As of June 11, 2014 2:45:25 AM +0200, Karsten Bräckelmann is alleged to have said: Worse, enabling charset normalization completely breaks UTF-8 chars in the regex. At least in my ad-hoc --cf command line testing. --As for the rest, it is mine. This sounds like something where `use

Re: Operations on headers in UTF-8

2014-06-10 Thread Karsten Bräckelmann
On Tue, 2014-06-10 at 17:39 -0400, Alex wrote: > On Tue, Jun 10, 2014 at 3:25 PM, Karsten Bräckelmann wrote: > It's here where I'm starting to lose you: Reading through your reply, I see we need to get to the basics first. You are massively confusing different types of encoding and not fully real

Re: Operations on headers in UTF-8

2014-06-10 Thread Alex
Hi, On Tue, Jun 10, 2014 at 3:25 PM, Karsten Bräckelmann wrote: > > On Tue, 2014-06-10 at 13:53 -0400, Alex wrote: > > I'm not very familiar with how to manage language encoding, and hoped > > someone could help. Some time ago I wrote a rule that looks for > > subjects that consist of a single wo

Re: Operations on headers in UTF-8

2014-06-10 Thread Karsten Bräckelmann
On Tue, 2014-06-10 at 13:53 -0400, Alex wrote: > I'm not very familiar with how to manage language encoding, and hoped > someone could help. Some time ago I wrote a rule that looks for > subjects that consist of a single word that's more than N characters. > It works, but I'm learning that it's per

Operations on headers in UTF-8

2014-06-10 Thread Alex
Hi all, I'm not very familiar with how to manage language encoding, and hoped someone could help. Some time ago I wrote a rule that looks for subjects that consist of a single word that's more than N characters. It works, but I'm learning that it's performed before the content of the subject is con