On Tue, Jul 15, 2008 at 11:17:23PM +0200, Leopold Toetsch wrote: > 21:51 < pmichaud> so unicode:"«" and unicode:"\xab" would produce > exactly > the same result. > 21:51 < pmichaud> even down to being the same .pbc output. > 21:51 < allison> pmichaud: exactly > > The former is a valid char in an UTF8/iso-8859-1 encoded source file and only > there, while the latter is a single invalid UTF8 char part. How would you > interpret unicode:"\xab\x65" then?
I'd want \xab and \x65 to represent two codepoints, not encoding bytes for a single codepoint. Pm