On Mar 30, 12:20 pm, "jd" <[EMAIL PROTECTED]> wrote: > I am looking for python code (working or sample code) that can take an > html document created by Microsoft Word and clean it up (if you've > never had to look at a Word-generated html document, consider yourself > lucky ;-) Alternatively, if you know of a non-python solution, I'd > like to hear about it. > > Thanks... > > -- jeff
You could try Beautiful Soup at http://www.crummy.com/software/BeautifulSoup/documentation.html Python is good for parsing HTML/XML, so you could also try googling Python parsing as well. Mike -- http://mail.python.org/mailman/listinfo/python-list