On Jun 1, 1:07 am, [EMAIL PROTECTED] (Oryann9) wrote:
> > Oddly, there's a uri_unescape_utf8 but no
> > uri_unescape_utf8 provided
> > by URI::Escape.
>
> > However combining URI::Escape::uri_unescape() and
> > Encode::decode_utf8()
> > in one statement is not overly taxing.
>
> > use Encode;
> > use URI::Escape qw(uri_unescape);
> > my $e_accute = decode_utf8 uri_unescape '%C3%A9';
>
> Is %C3 equal to à Capital A, tilde
> and
> %A9 equal to © Copyright.

No they are not _equal_.

In URI escaping %C3 simply encodes the _byte_ with a hexadecimal value
C3, %A9 the one with value A9.

What sequence of _characters_ (aka "Unicode code points") that two
byte sequence encode will depend upon the encoding in use. In Latin1
they encode 'é'. In UTF8 they encode 'é'. (In other encodings they'll
encode other things).

> So you are trying to convert to HTML Flash found here
> http://www.allwebco-templates.com/support/S_hex.htm?

No, that assumes Latin1 encoding. The OP is talking about utf8.


--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/


Reply via email to