Jeffrey C. Jacobs <[EMAIL PROTECTED]> added the comment:
Quick update (see thread 2636 for other updates to the Regex
functionality): I do have a design in my mind for how to accomplish both
Atomic Grouping and Possessive Qualifiers and it would work with either
the existing Engine design
Jeffrey C. Jacobs <[EMAIL PROTECTED]> added the comment:
Thank you and Merci Antoine!
That is a good point. It is clearly specific to the compiler whether a
switch-case will be turned into a series of conditional branches or
simply creating an internal jump table with lookup. And it i
Jeffrey C. Jacobs <[EMAIL PROTECTED]> added the comment:
I am making my changes in a Bazaar branch hosted on Launchpad. It took
me quite a while to get things set up more-or-less logically but there
they are and I'm currently trying to re-apply my local changes up to
today into
Jeffrey C. Jacobs added the comment:
Matthew, I think that is considered a single word in Sanscrit or Thai so Python
3.x is correct. In this case you've written the Sanscrit word for Hindi.
--
___
Python tracker
<http://bugs.py
Jeffrey C. Jacobs added the comment:
Maybe you could show us the byte-for-byte hex of the string you're testing so
we can examine if it's really a code point intending word boundary or just a
code point for the sake of beginning a new
Jeffrey C. Jacobs added the comment:
Thanks Matthew and sorry to put you through more work; I just wanted to verify
exactly which unicode (UTF-16 I take it) were being used to verify if the
UNICODE standard expected them to be treated as unique words or single letters
within a word. Sanskrit
Jeffrey C. Jacobs added the comment:
Re: timings
Thanks for the info, John. First of all, I really like those tests and
could you please submit a patch or other document so that we could
combine them into the python test suite.
The python test suite, which can be run as part of 'make
Jeffrey C. Jacobs added the comment:
Mea culpa et mes apologies,
The '-s' option to John's expressions are indeed executed only once --
they are one-time setup lines. The final quoted expression is what's
run multiple times.
In other words, improving caching in regex w
Jeffrey C. Jacobs added the comment:
Okay, as I said, Atomic Grouping, etc., off a recent 2.6 is already
available and I can do any cleanups requested to those already
mentioned, I just don't want to start any new items at the moment. As
it is, we are still over a year from any of
Jeffrey C. Jacobs added the comment:
Thanks, Antione! Then I think for the most part any changes to Regexp
will have to wait for 3.2 / 2.7.
--
message_count: 71.0 -> 72.0
___
Python tracker
<http://bugs.python.org/iss
Jeffrey C. Jacobs added the comment:
My only addition opinion is that re is very much used in deployed python
applications and was written not just for correctness but also speed. As such,
regex should be benchmarked fairly to show that it is commensurately speedy. I
wouldn'
Jeffrey C. Jacobs added the comment:
What about a regex flag? Like regex.W or (?w)?
--
___
Python tracker
<http://bugs.python.org/issue2636>
___
___
Python-bug
Changes by Jeffrey C. Jacobs :
--
nosy: +timehorse
___
Python tracker
<http://bugs.python.org/issue17980>
___
___
Python-bugs-list mailing list
Unsubscribe:
Jeffrey C. Jacobs added the comment:
If I recall, I started this thread with a plan to update re itself with
implementations of various features listed in the top post. If you look at the
list of files uploaded by me there are seme complete patches for Re to add
various features like Atomic
101 - 114 of 114 matches
Mail list logo