I think you're looking for #px"\\p{L}". See the "\p" <atom> production and the <category> non-terminal in
http://docs.racket-lang.org/reference/regexp.html#(part._regexp-syntax) At Sat, 04 Aug 2012 14:45:30 -0700, Charles Hixson wrote: > Are there any unicode regular expression character classes? > > I'm hoping for something similar to [:alpha:], etc. that are based > around, say, the first letter of the unicode character classification. > I *can* do what I want by disassembling strings by hand and using tests > based on char-general-category, but a regular expression would (should?) > be much neater. > > (I know that these aren't mentioned in the documentation, but it just > says that it's talking about the "Frequently Used Character Classes", > not that there aren't any others.) > > -- > Charles Hixson ____________________ Racket Users list: http://lists.racket-lang.org/users