Re: Comparing strings from the back?

Dan Goodman Mon, 10 Sep 2012 09:18:33 -0700

On 04/09/2012 03:54, Roy Smith wrote:

Let's assume you're testing two strings for equality.  You've already
done the obvious quick tests (i.e they're the same length), and you're
down to the O(n) part of comparing every character.


I'm wondering if it might be faster to start at the ends of the strings
instead of at the beginning?  If the strings are indeed equal, it's the
same amount of work starting from either end.  But, if it turns out that
for real-life situations, the ends of strings have more entropy than the
beginnings, the odds are you'll discover that they're unequal quicker by
starting at the end.

From the rest of the thread, it looks like in most situations it won'tmake much difference as typically very few characters need to becompared if they are unequal.

However, if you were in a situation with many strings which were almostequal, the most general way to improve the situation might be store ahash of the string along with the string, i.e. store (hash(x), x) andthen compare equality of this tuple. Almost all of the time, if thestrings are unequal the hash will be unequal. Or, as someone elsesuggested, use interned versions of the strings. This is basically thesame solution but even better. In this case, your startup costs will behigher (creating the strings) but your comparisons will always be instant.


Dan

--
http://mail.python.org/mailman/listinfo/python-list

Re: Comparing strings from the back?

Reply via email to