There's more to re than just sub. How about: sanesplit = re.split(r"</td><td>|<td>|</td>", text) date = sanesplit[1] times = times = [time for time in sanesplit if re.match("\d\d:\d\d", time)]
... then "date" contains the date at the beginning of the line and "times" contains all your times. -- http://mail.python.org/mailman/listinfo/python-list