Re: How to print all expressions that match a regular expression

Tim Chase Sun, 07 Feb 2010 05:56:29 -0800

hzh...@gmail.com wrote:

"Given the function hashlib.sha256, enumerate all the possible inputs
that give the hexadecimal result
0a2591aaf3340ad92faecbc5908e74d04b51ee5d2deee78f089f1607570e2e91."


This is a hash collision problem. Nobody has proved that SHA-256 is
collision free

It's actually pretty easy to prove that it is *not* collisionfree. The SHA-256 encodes 512 bits of data. So the the processof encoding (2**512)+1 distinct inputs incurs a collision inSHA-256 space as soon as you've hit (2**512)+1 if not earlier.


to start you off:

  sha_backmap = {}
  for i in xrange((2**512)+2):
    hash = sha(str(i))
    if hash in sha_backmap:
      print "Collision found: %i and %i" % (
        i, sha_backmap[hash])
        break
    sha_backmap[hash] = i

Though it might take a computer the size of the universe, so I'mguessing that the first collision encountered is with "42". Ileave the actual calculation and hashing of all possiblecombinations of 513 bits of data as an exercise to the readerwith a lot of time on their hands or a quantum computer undertheir desk ;-)

It is hard to tell in advance. However, we can add some timing limit
or counting limit, to make it an algorithm, which can halt. For
example, whenever the program outputs more than 1000000 expressions
that match the input regex, we can halt because that exceeds our
limit. But surely this is not efficient because of the post-decision.

As mentioned, it sounds like you either want a depth-first of thesolution space that raises exceptions on an infinite/unboundedoperator ("*", "+", and "{N,}" as mentioned in another email), orif you want to handle those operators, do a breadth-first searchof the solution-space and track your depth (or time taken, orprevious number of multi-factor atoms if you desire) to ensureyou don't exceed a certain depth. But you're still talking acombinatorial number of solutions for even simple regexps.


-tkc


--
http://mail.python.org/mailman/listinfo/python-list

Re: How to print all expressions that match a regular expression

Reply via email to