On 07/24/19 03:45, John Naylor wrote: > On Sun, Jul 21, 2019 at 3:14 AM Tom Lane <t...@sss.pgh.pa.us> wrote: >> However, my second reaction was that maybe you were on to something >> upthread when you speculated about postponing de-escaping of >> Unicode literals into the grammar. If we did it like that then
Wow, yay. I hadn't been following this thread, but I had just recently looked over my own earlier musings [1] and started thinking "no, it would be outlandish to ask the lexer to return utf-8 always ... but what about postponing the de-escaping of Unicode literals into the grammar?" and had started to think about when I might have a chance to try making a patch. With the de-escaping postponed, I think we'd be able to move beyond the current odd situation where Unicode escapes can't describe non-ascii characters, in exactly and only the cases where you need them to. -Chap [1] https://www.postgresql.org/message-id/6688474e-7c28-b352-bcec-ea0ef59d7a1a%40anastigmatix.net