On 13/08/2014 7:28 AM, Roy Smith wrote:
Second, if you're going to be parsing web pages, trying to use regexes
is a losing game. You need something that knows how to parse HTML. The
canonical answer is lxml (http://lxml.de/), but Beautiful Soup
(http://www.crummy.com/software/BeautifulSoup/) is
On Tue, 12 Aug 2014 13:00:30 -0700, Simon Evans wrote:
> in accessing from the 'Racing Post' on a daily basis. Anyhow, the code
Following is some starter code. You will have to look at the output,
compare it to the web page, and work out how you want to process it
further. Note that I use beaut
On Tue, 12 Aug 2014 15:44:58 -0700 (PDT), Simon Evans wrote:
[snip]
> Dear Programmers, Thank you for your responses. I have installed
> 'Beautiful Soup' and I have the 'Getting Started in Beautiful Soup'
> book, but can't seem to make any progress with it, I am too thick to
> make much use of it.
In article <53eaab7d$0$29979$c3e8da3$54964...@news.astraweb.com>,
Steven D'Aprano wrote:
> By studying how other scraping programs work, and studying how your racing
> pages store data, you should be able to put the two together and see how to
> get the data you want.
It's also worth mentioning
Simon Evans wrote:
> Dear Programmers, Thank you for your responses. I have installed
> 'Beautiful Soup' and I have the 'Getting Started in Beautiful Soup' book,
> but can't seem to make any progress with it, I am too thick to make much
> use of it. I was hoping I could scrape specified stuff off
On Tuesday, August 12, 2014 9:00:30 PM UTC+1, Simon Evans wrote:
> Dear Programmers,
>
> I have been looking at the You tube 'Web Scraping Tutorials' of Chris Reeves.
> I have tried a few of his python programs in the Python27 command prompt, but
> altered them from accessing data using links sa
In article ,
Simon Evans wrote:
> Dear Programmers,
> I have been looking at the You tube 'Web Scraping Tutorials' of Chris Reeves.
> I have tried a few of his python programs in the Python27 command prompt, but
> altered them from accessing data using links say from the Dow Jones index, to
>
On Tue, 12 Aug 2014 13:00:30 -0700 (PDT)
Simon Evans wrote:
> Dear Programmers,
> I have been looking at the You tube 'Web Scraping Tutorials' of Chris Reeves.
> I have tried a few of his python programs in the Python27 command prompt, but
> altered them from accessing data using links say from