>>>>> "Jean-Pierre" == Jean-Pierre Chrétien <[EMAIL PROTECTED]> writes:

Jean-Pierre> Could the diacritics problem come from the list of
Jean-Pierre> locales not including utf8 ?

What is the diacritics problem? That we cannot check accented
characters? We should find a way to read unicode table and decide what
is the class of the different characters (to replace the code in
support/textutils.h). I think this is a prerequisite for 1.5.

Is this
 http://crl.nmsu.edu/~mleisher/ucdata.html
something we could use? Or do we have a way to coerce the c library to
 give us the information we need without switching locales?

I see here that boost.Regex is able to use character properties:
http://www.boost.org/libs/regex/doc/character_class_names.html

Does that mean that we should also be able to access them with our
current code?

JMarc

Reply via email to