Re: encoding vs charset

Patrick R. Michaud Tue, 15 Jul 2008 14:35:29 -0700

On Tue, Jul 15, 2008 at 11:17:23PM +0200, Leopold Toetsch wrote:
> 21:51 < pmichaud> so   unicode:"«"   and unicode:"\xab"  would produce 
> exactly 
> the same result.
> 21:51 < pmichaud> even down to being the same .pbc output.
> 21:51 < allison> pmichaud: exactly
> 
> The former is a valid char in an UTF8/iso-8859-1 encoded source file and only 
> there, while the latter is a single invalid UTF8 char part. How would you 
> interpret unicode:"\xab\x65" then?


I'd want \xab and \x65 to represent two codepoints, not encoding bytes
for a single codepoint.

Pm

Re: encoding vs charset

Reply via email to