On 11 Sep 2007 at 16:09, Chas Owens wrote:
> On 9/11/07, Jenda Krynicky <[EMAIL PROTECTED]> wrote:
> > On 11 Sep 2007 at 15:15, Srinivas wrote:
> > > I want to write a perl script that scrapes various job sites like
> > > monster, dice, career builders etc.
> > >
> > > Given the job id and web site name it should scrape the
> > > information and store in a mySQL database.
> >
> > And are you sure they won't mind? I don't work there anymore, but
> > still ... you should make sure what you plan to do is OK with them.
> snip
> 
> The easiest way to do this is to obey their robots.txt file.  You can
> learn more about robots.txt here:
> http://www.robotstxt.org/wc/faq.html.  Also, be careful, the text you
> are copying is still copyrighted and you cannot republish more than a
> snippet without running into potential legal hazards.

I don't think that's enough. It's one thing to index a site for searching 
(think ... Google) and another 
to scrape the data and present it elsewhere as yours. The fact that it's OK to 
run a script to 
download some data doesn't mean all uses of said data are all right.

Jenda
=============== [EMAIL PROTECTED] ==================
Always code as if the guy who ends up maintaining your code will be a
violent psychopath who knows where you live.
      -- Rick Osborne, <[EMAIL PROTECTED]>



-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/


Reply via email to