Simon Forman wrote: > That third option seems to work fine. Well it does, but there are still many things wrong with it
if len(tok) > 0: should be written as if(tok): tok = '' tok = toc + c should be written as tok = [] tok.append(c) and later ''.join(toc) anyway, the entire thing should be replaced with something like this: import re def breakLine(s): splitters = '?()&|:~,' chars = '^ \t\n\r\f\v%s' % splitters regex = '''( (?:[%s]) | (?:[%s]+))''' % (splitters, chars) return re.findall(regex, s,re.VERBOSE) That should be able to be simplified even more if one were to use the character lists built into the regex standard. -- - Justin -- http://mail.python.org/mailman/listinfo/python-list