Re: search an entire website given the homepage URL

2006-04-25 Thread Fredrik Lundh
"Bell, Kevin" wrote: > I would like some feedback about my actual intention though, which is to > scrape local newspaper websites for the names of people that I work > with. Twice this month, colleagues have unknowingly been in the > newspaper, and only became aware of it because someone stumbled

Re: search an entire website given the homepage URL

2006-04-25 Thread Fredrik Lundh
"Bell, Kevin" wrote: > Well I certainly am learning a lot. I never said I intended to download > anyone's entire website, as was assumed, but it's been fun to see how > folks feel about it anyway! never? "If I want to search through the entire python website give[n] the homepage, how wo

Re: search an entire website given the homepage URL

2006-04-25 Thread Terry Reedy
"Bell, Kevin" <[EMAIL PROTECTED]> wrote in message news:[EMAIL PROTECTED] > I would like some feedback about my actual intention though, which is to > scrape local newspaper websites for the names of people that I work > with. Twice this month, colleagues have unknowingly been in the > newspaper

Re: search an entire website given the homepage URL

2006-04-25 Thread Bell, Kevin
Fredrik wrote: to grab entire sites ? try doing that on a commercial data provider's site, and chances are that you'll end up being banned (or sued) within hours ... - Me: Nope, I never said that to start with... Well I certainly am learning a lot. I never said I intended to downlo

Re: search an entire website given the homepage URL

2006-04-25 Thread Steve Holden
Bell, Kevin wrote: >>use a search engine (try the search box in the upper right corner). > > >>using a spider to download the entire site just so you can "search > > through >it" is bloody impolite. > > Really? I'd argue that's impolite only if you're an impolite person > with a rude agenda, w

Re: search an entire website given the homepage URL

2006-04-25 Thread Fredrik Lundh
"Bell, Kevin" wrote: > >use a search engine (try the search box in the upper right corner). > > >using a spider to download the entire site just so you can "search > through >it" is bloody impolite. > > Really? I'd argue that's impolite only if you're an impolite person > with a rude agenda, whic

Re: search an entire website given the homepage URL

2006-04-25 Thread Bell, Kevin
>use a search engine (try the search box in the upper right corner). >using a spider to download the entire site just so you can "search through >it" is bloody impolite. Really? I'd argue that's impolite only if you're an impolite person with a rude agenda, which is not what I had in mind, but t

Re: search an entire website given the homepage URL

2006-04-25 Thread Fredrik Lundh
"Bell, Kevin" wrote: > I know I can use urllib2 to get at a website given urllib2.urlopen(url) > but I'm unsure how to then go through all pages that are linked to it, > but still in the domain. If I want to search through the entire python > website give the homepage, how would I go about it? u

search an entire website given the homepage URL

2006-04-25 Thread Bell, Kevin
I know I can use urllib2 to get at a website given urllib2.urlopen(url) but I'm unsure how to then go through all pages that are linked to it, but still in the domain. If I want to search through the entire python website give the homepage, how would I go about it? I don't reinvent the wheel if s