Thank you, yes, that is what I was looking for.
I must have read right over it 3-4 times without seeing it.

On 08/04/2012 02:59 PM, Matthew Flatt wrote:
I think you're looking for #px"\\p{L}".

See the "\p"<atom>  production and the<category>  non-terminal in

    http://docs.racket-lang.org/reference/regexp.html#(part._regexp-syntax)

At Sat, 04 Aug 2012 14:45:30 -0700, Charles Hixson wrote:
Are there any unicode regular expression character classes?

I'm hoping for something similar to [:alpha:], etc. that are based
around, say, the first letter of the unicode character classification.
I *can* do what I want by disassembling strings by hand and using tests
based on char-general-category, but a regular expression would (should?)
be much neater.

(I know that these aren't mentioned in the documentation, but it just
says that it's talking about the "Frequently Used Character Classes",
not that there aren't any others.)

--
Charles Hixson



--
Charles Hixson

____________________
 Racket Users list:
 http://lists.racket-lang.org/users

Reply via email to