[EMAIL PROTECTED] wrote:
> I'm trying to make a unicode friendly regexp to grab sentences
> reasonably reliably for as many unicode languages as possible, focusing
> on european languages first, hence it'd be useful to be able to refer
> to any uppercase unicode character instead of just the typica
On 25/05/2006 5:43 AM, [EMAIL PROTECTED] wrote:
> I'm trying to make a unicode friendly regexp to grab sentences
> reasonably reliably for as many unicode languages as possible, focusing
> on european languages first, hence it'd be useful to be able to refer
> to any uppercase unicode character ins
Sorry...I somehow missed the key *uppercase* bit of that, and
somehow got it in my head that you just wanted unicode letters,
not numbers. Please pardon the brain-blink. I can't find
anything in Python's regexp docs that do what you want. Vim's
regexp engine has a "uppercase characters" and
> I'm trying to make a unicode friendly regexp to grab sentences
> reasonably reliably for as many unicode languages as
> possible, focusing on european languages first, hence it'd be
> useful to be able to refer to any uppercase unicode character
> instead of just the typical [A-Z], which doesn't
I'm trying to make a unicode friendly regexp to grab sentences
reasonably reliably for as many unicode languages as possible, focusing
on european languages first, hence it'd be useful to be able to refer
to any uppercase unicode character instead of just the typical [A-Z],
which doesn't include, f
I'm trying to make a unicode friendly regexp to grab sentences
reasonably reliably for as many unicode languages as possible, focusing
on european languages first, hence it'd be useful to be able to refer
to any uppercase unicode character instead of just the typical [A-Z],
which doesn't include, f