Re: python screen scraping/parsing

2008-06-14 Thread Larry Bates
Dan Stromberg wrote: BeautifulSoup is a pretty nice python module for screen scraping (not necessarily well formed) web pages. On Fri, 13 Jun 2008 11:10:09 -0700, bruce wrote: Hi... got a short test app that i'm playing with. the goal is to get data off the page in question. basically, i sh

Re: python screen scraping/parsing

2008-06-13 Thread Paul Boddie
On 13 Jun, 23:09, "bruce" <[EMAIL PROTECTED]> wrote: > > Thanks for the reply. Came to the same conclusion a few minutes before I saw > your email. > > Another question: > > tr=d.xpath(foo) > > gets me an array of nodes. > > is there a way for me to then iterate through the node tr[x] to see if a >

RE: python screen scraping/parsing

2008-06-13 Thread bruce
PM To: python-list@python.org Subject: Re: python screen scraping/parsing On 13 Jun, 20:10, "bruce" <[EMAIL PROTECTED]> wrote: > > url ="http://www.pricegrabber.com/rating_summary.php/page=1"; [...] > tr = > "/html/body/[EMAIL PROTECTED]&#x

Re: python screen scraping/parsing

2008-06-13 Thread Paul Boddie
On 13 Jun, 20:10, "bruce" <[EMAIL PROTECTED]> wrote: > > url ="http://www.pricegrabber.com/rating_summary.php/page=1"; [...] > tr = > "/html/body/[EMAIL PROTECTED]'pgSiteContainer']/[EMAIL > PROTECTED]'pgPageContent']/table[2]/tbo > dy/tr[4]" > > tr_=d.xpath(tr) [...] > my issu

Re: python screen scraping/parsing

2008-06-13 Thread Dan Stromberg
BeautifulSoup is a pretty nice python module for screen scraping (not necessarily well formed) web pages. On Fri, 13 Jun 2008 11:10:09 -0700, bruce wrote: > Hi... > > got a short test app that i'm playing with. the goal is to get data off > the page in question. > > basically, i should be abl

python screen scraping/parsing

2008-06-13 Thread bruce
Hi... got a short test app that i'm playing with. the goal is to get data off the page in question. basically, i should be able to get a list of "tr" nodes, and then to iterate/parse them. i'm missing something, as i think i can get a single node, but i can't figure out how to display the content