Mike Gran <spk...@yahoo.com> writes: >> Thanks for the Unicode 6.1 update! Now, however: >> >> FAIL: srfi-14.test: Latin-1 (8-bit charset): char-set:symbol >> >> Would you be willing to investigate? > > Looks like Unicode 6.1 has recategorized some of the symbols, including > a few in Latin-1. > > "§" U+00A7 SECTION SIGN from Symbol_Other to Punctuation_Other > "¶" U+00B6 PILCROW SIGN from Symbol_Other to Punctuation_Other > > It seems that the correct response would be just to change > the Latin-1 test cases. > > A wrinkle, though, is that in SRFI-14, they call out "§" and "¶" > as symbols. But my interpretation of the text in SRFI-14 is that > they intended to follow Unicode's categorization.
Agreed. > http://srfi.schemers.org/srfi-14/srfi-14.html SRFI-14 states: char-set:symbol In Unicode, a symbol is any character that has one of the symbol categories in the Unicode character database (Sm, Sc, Sk, or So). and I think that this is intended to be the normative definition. SRFI-14 then proceeds to list the symbols of ASCII and Latin-1, but I interpret that as non-normative, to save the reader the trouble of consulting Unicode. IMHO, anyway. Thanks! Mark