Re: [mp2] OutputFilter with UTF-8 characters

2003-11-12 Thread Stas Bekman
Ged Haywood wrote: Hi Stas, On Tue, 11 Nov 2003, Stas Bekman wrote: [snip] Who would think that a response handler: print while ; would affect the special vars in the filter called by print. Who indeed! [snip] So let's decide how do we act upon this: [snip] I'm not sure whether we should

Re: [mp2] OutputFilter with UTF-8 characters

2003-11-12 Thread Ged Haywood
Hi Stas, On Tue, 11 Nov 2003, Stas Bekman wrote: > [snip] > Who would think that a response handler: > >print while ; > > would affect the special vars in the filter called by print. Who indeed! > [snip] > So let's decide how do we act upon this: > [snip] > I'm not sure whether we should

Re: [mp2] OutputFilter with UTF-8 characters

2003-11-11 Thread Stas Bekman
Matthew Darwin wrote: After extensive playing around with this (inside mod_perl and out), I have come up with two observations: 1) doing regexes on UTF-8 characters split across buckets in an output filter seems to be not a problem. All my regexes are against ASCII characters. Good. But could

Re: [mp2] OutputFilter with UTF-8 characters

2003-11-11 Thread Matthew Darwin
After extensive playing around with this (inside mod_perl and out), I have come up with two observations: 1) doing regexes on UTF-8 characters split across buckets in an output filter seems to be not a problem. All my regexes are against ASCII characters. 2) mod_perl seems to get confused whe

Re: [mp2] OutputFilter with UTF-8 characters

2003-11-05 Thread Stas Bekman
Matthew Darwin wrote: Stas Bekman wrote: I'd suggest to take whatever data you s/// and try it outside mod_perl first. May be your filter or some previous filter has truncated the UTF-8 char in the middle? You should be aware that other filters are not aware of the encoding, and they just giv

Re: [mp2] OutputFilter with UTF-8 characters

2003-11-05 Thread Matthew Darwin
Stas Bekman wrote: I'd suggest to take whatever data you s/// and try it outside mod_perl first. May be your filter or some previous filter has truncated the UTF-8 char in the middle? You should be aware that other filters are not aware of the encoding, and they just give you the amount of dat

Re: [mp2] OutputFilter with UTF-8 characters

2003-11-05 Thread Stas Bekman
Matthew Darwin wrote: I'm just wondering if anyone has any input on this issue. I'm implementing an output filter, like so: SetHandler perl-script PerlResponseHandler ModPerl::Registry PerlOutputFilterHandler Apache::Kinnetics::Output and I get the following error on some web pages that have U

[mp2] OutputFilter with UTF-8 characters

2003-11-05 Thread Matthew Darwin
I'm just wondering if anyone has any input on this issue. I'm implementing an output filter, like so: SetHandler perl-script PerlResponseHandler ModPerl::Registry PerlOutputFilterHandler Apache::Kinnetics::Output and I get the following error on some web pages that have UTF-8 data: [Wed Nov 0