That definitely turned me on to lxml as well. I've been running with BeautifulSoup but sometimes it chokes and lxml seems to play better.
I was also reading about a BeautifulSoup wrapper for lxml but have not gotten into it much as I was working on the first part of this with twill tonight. Perhaps tomorrow I can have it parsing out the data I need. :0) On Nov 11, 7:47 pm, Richard <richar...@gmail.com> wrote: > that blog post got me interested in lxml, which is a very efficient > library with a robust HTML parser. > > On Nov 12, 10:51 am, David <digitalcry...@gmail.com> wrote: > > > what DOM parser do you recommend or do you know is available? > > > <snip> > > > In my search for answers I happened across this link. > > >http://blog.ianbicking.org/2008/03/30/python-html-parser-performance/ > > > - David > > > On Nov 11, 4:52 pm, Wes James <compte...@gmail.com> wrote: > > > > On Wed, Nov 11, 2009 at 3:48 PM, Dmitri Zagidulin <dzagidu...@gmail.com> > > > wrote: > > > > <snip> > > > > > # Go to a url > > > > go('http://en.wikipedia.org/wiki/Web2py') > > > > > # Use formvalue() and submit() functions to log in > > > > > xhtml = show() # Capture the contents of the html page in a variable > > > > > # Send the variable to a DOM parser, or use regexps, or whatever you > > > > like > > > > what DOM parser do you recommend or do you know is available? > > > > <snip> > > > > -wes > > --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "web2py-users" group. To post to this group, send email to web2py@googlegroups.com To unsubscribe from this group, send email to web2py+unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/web2py?hl=en -~----------~----~----~----~------~----~------~--~---