On Fri, 21 Nov 2014 17:28:19 -0800 Evan Gates <evan.ga...@gmail.com> wrote:
> Notice there's no mention of ASCII, so bytes 0x80 to 0xFF are valid. > For sbase we want UTF-8 support. Should we assume/enforce only valid > UTF-8? Doing so makes a lot of coding easier and less sucky, but means > that some POSIX text files will not be sbase text files when we run > into the aforementioned bytes. In this case what's more important? > Strict POSIX compliance? Or code that sucks less? UTF-8 is the way to go. TLH and I have already started porting sbase to libutf. For the reasons, I'll just link this[0] document here for your reading pleasure. Cheers FRIGN [0]: http://utf8everywhere.org/ -- FRIGN <d...@frign.de>