On Sat, May 05, 2007 at 08:52:15AM -0700, Wiseman wrote: > > I believe the current Python re module was written to replace the Python > > wrapping of pcre in order to support unicode. > > I don't know how PCRE was back then, but right now it supports UTF-8 > Unicode patterns and strings, and Unicode character properties. Maybe > it could be reintroduced into Python?
I would say this is a case for "rough consensus and working code". With something as big and ugly[1] as a regexp library, I think the "working code" part will be the hard part. So, if you have a patch, there's a decent chance such a thing would be adopted. I'm not sure what your skill level is, but I would suggest studying the code, starting in on a patch for one or more of these features, and then corresponding with the module's maintainers to improve your patch to the point where it can be accepted. Dustin -- http://mail.python.org/mailman/listinfo/python-list