Phillip B Oldham wrote:
> I'm going to throw together a quick project over the weekend: a
> spider. I want to scan a website for certain elements.
> 
> I come from a PHP background, so normally I'd:
>  - throw together a quick REST script to handle http request/responses

Use the urllib/urllib2 module in the stdlib for GET/POST with parameters or
lxml for simple page requests.


>  - load the pages into a simplexml object and
>  - run an xpath over the dom to find the nodes I need to test

Use lxml.

http://codespeak.net/lxml/

Stefan
--
http://mail.python.org/mailman/listinfo/python-list

Reply via email to