TheFlyingDutchman wrote: > On Sep 20, 8:04 pm, crybaby <[EMAIL PROTECTED]> wrote: >> I need to traverse a html page with big table that has many row and >> columns. For example, how to go 35th td tag and do regex to retireve >> the content. After that is done, you move down to 15th td tag from >> 35th tag (35+15) and do regex to retrieve the content? > > Make the file an xhtml file (valid xml) if it isn't already and then > you can use software written to process XML files: > > http://pyxml.sourceforge.net/topics/
... or just use software that can process XML and HTML the same way *and* that supports XPath and tree iteration so that you can easily select the content you want. http://codespeak.net/lxml/ Stefan -- http://mail.python.org/mailman/listinfo/python-list