On 11/23/15 19:09, Paolo Bonzini wrote: > > > On 23/11/2015 18:59, Laszlo Ersek wrote: >>>> The only interesting thing is that parse_keyword always eats >>>> a TOKEN_KEYWORD, even if it is invalid, so it must come last in >>>> parse_value (otherwise, NULL is returned, parse_literal is invoked >>>> and it tries to peek beyond end of input). This is caught by >>>> /errors/unterminated/literal, which actually checks for an unterminated >>>> keyword. ಠ_ಠ >> Is it accepted practice to put UTF-8 in commit messages? (Or, actually, >> anywhere in patches, except maybe the notes section?) >> >> I'd recommend o_O. > > Perhaps not Kannada, but Unicode 0xA1 to 0x1FF is pretty common in > commit messages.
That seems to be a subset of the union of the following three: https://en.wikipedia.org/wiki/Latin-1_Supplement_%28Unicode_block%29 https://en.wikipedia.org/wiki/Latin_Extended-A https://en.wikipedia.org/wiki/Latin_Extended-B Thanks for the info, Laszlo