Take a look at the -C argument for perl and the PERL_UNICODE environment
variable in http://perldoc.perl.org/perlrun.html

Examine the difference between

perl -E 'say "\x{df}"'

and

PERL_UNICODE=O perl -E 'say "\x{df}"'

That said, if you are working with the web, why in the world are you
sending UTF-8?  HTML has entities for a reason.  I would suggest using
HTML::Entities instead of trying to send non-ASCII characters through who
knows how many layers of things that can screw up UTF-8:

perl -MHTML::Entities -E 'say encode_entities "\x{df}"'



On Tue, Aug 9, 2016 at 7:34 AM hw <h...@gc-24.de> wrote:

> Chas. Owens schrieb:
> >
> > On Thu, Jul 28, 2016 at 10:55 AM Paul Johnson <p...@pjcj.net <mailto:
> p...@pjcj.net>> wrote:
> >
> >     On Thu, Jul 28, 2016 at 10:23:19AM -0400, Chas. Owens wrote:
> >
> > snip
> >
> >      > Also, this answer on StackOverflow by tchrist (Tom Christiansen,
> who I
> >      > would say knows the most about the intersection of Perl and
> Unicode)
> >      > is a good resource: http://stackoverflow.com/a/6163129/78259
> >
> >     Quite.  And utf8::all tries to encapsulate as much of that
> boilerplate
> >     as it can.
> >
> >
> > I have always read that answer as a bit of an indictment of the idea of
> "you should be able to load this module and everything will be fine".
> Unicode is complex and trying to treat it like just another list of
> characters is doomed to teeth gnashing and crying.  Of course, even
> treating it the way it should be leads to teeth gnashing and crying, but at
> least that will be over the fact the humans suck (we can't even agree on
> where þ should be sorted) as opposed to Perl sucking.
>
> When I have something like
>
>
> print $cgi->p('Gebäudefläche:');
>
>
> in my source, which is correctly displayed everywhere else, I also
> need it correctly displayed in the web browser --- even particularly
> there because that is what the users are looking at.
>
> And that´s all there is to it.  It´s really that simple.
>
>

Reply via email to