On Jun 1, 1:07 am, [EMAIL PROTECTED] (Oryann9) wrote: > > Oddly, there's a uri_unescape_utf8 but no > > uri_unescape_utf8 provided > > by URI::Escape. > > > However combining URI::Escape::uri_unescape() and > > Encode::decode_utf8() > > in one statement is not overly taxing. > > > use Encode; > > use URI::Escape qw(uri_unescape); > > my $e_accute = decode_utf8 uri_unescape '%C3%A9'; > > Is %C3 equal to à Capital A, tilde > and > %A9 equal to © Copyright.
No they are not _equal_. In URI escaping %C3 simply encodes the _byte_ with a hexadecimal value C3, %A9 the one with value A9. What sequence of _characters_ (aka "Unicode code points") that two byte sequence encode will depend upon the encoding in use. In Latin1 they encode 'é'. In UTF8 they encode 'é'. (In other encodings they'll encode other things). > So you are trying to convert to HTML Flash found here > http://www.allwebco-templates.com/support/S_hex.htm? No, that assumes Latin1 encoding. The OP is talking about utf8. -- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] http://learn.perl.org/