I think you guys are missing the point. All you would need to add to get a 'probable match' is add another search that goes through the 10% that didnt get matched and do a "endswith" search on the data. From the example data you showed me, that would match a good 90% of the 10%, leaving you with a 1% that must be hand matched. You would have to combine this idea with Jeff Shannon's idea to make it work more efficiently.
-- http://mail.python.org/mailman/listinfo/python-list