Re: From wchar_t to char32_t

2023-07-13 Thread Paul Eggert
On 2023-07-13 08:14, Bruno Haible wrote: By reading the source code of FreeBSD, NetBSD, OpenBSD, macOS, Solaris, and so on, I can easily determine - which parts of the mbstate_t mbsinit() tests, - which parts of the mbstate_t the various functions use. But in order to understand what inter

mbchar: Optimize is_basic

2023-07-13 Thread Bruno Haible
So far, the is_basic_table in mbchar.c marked all bytes in the range 0x20..0x7E as "basic", but not all of the control characters 0x00..0x1F. But nowadays, all locale encodings map this range 0x00..0x1F to U+..U+001F. The last encodings which did not have this property were VISCII and TCVN5712-

localcharset: Clean up locale encodings used by glibc

2023-07-13 Thread Bruno Haible
This patch removes from lib/localcharset.h some of those encodings that are not supported by glibc for more than 10 years. 2023-07-13 Bruno Haible localcharset: Clean up locale encodings used by glibc. * lib/localcharset.h: Remove VISCII from the list, since never supported

Re: not bug, but issue

2023-07-13 Thread Bruno Haible
Hi, Alexei Sholomitskiy wrote: > i have run ./gnulib-tool --create-megatestdir --with-tests --dir=mybuild > > unforunately, process is very long Yes, --create-megatestdir is only reasonable for a small set (say, at most 10) modules. Running it with all > 1800 modules of gnulib will likely take m

not bug, but issue

2023-07-13 Thread Alexei Sholomitskiy
i have run ./gnulib-tool --create-megatestdir --with-tests --dir=mybuild unforunately, process is very long and I had interrupted it why there is no any checks on submodules directories existents in order to continue interruupted process? Except this I see next error executing aclocal -I gl

Re: mbcel module for Gnulib?

2023-07-13 Thread Bruno Haible
Hi Paul, > > Candidates for optimization: > > > > - The C locale handling > >https://sourceware.org/bugzilla/show_bug.cgi?id=19932 > >https://sourceware.org/bugzilla/show_bug.cgi?id=29511 > >It's now a clear POSIX violation. Would it make sense to get this fixed > >in glibc, so th

Re: From wchar_t to char32_t

2023-07-13 Thread Bruno Haible
Paul Eggert wrote: > > Based on the comments in gnulib/lib/mbrtoc16.c, I think it should better > > clear the first 24, not 12, bytes of the struct. Otherwise it can be in > > a state where mbsinit() returns true but the mbrto* functions have > > undefined behaviour. > > For mbcel all all that mat

Re: From wchar_t to char32_t

2023-07-13 Thread Bruno Haible
I wrote: > 7.32.3.2 > towctrans -- rarely used > wctrans -- rarely used It's not hard to implement replacements for these two functions either. So that we get this correspondence: wchar_t char32_t --- t