Re: Fetching a clean copy of a changing web page

2007-07-17 Thread star . public
On Jul 16, 4:50 am, Stefan Behnel <[EMAIL PROTECTED]> wrote: > Diez B. Roggisch wrote: > > John Nagle wrote: > >>I'm reading the PhishTank XML file of active phishing sites, > >> at "http://data.phishtank.com/data/online-valid/"; This changes > >> frequently, and it's big (about 10MB right now

Re: Fetching a clean copy of a changing web page

2007-07-16 Thread Carsten Haese
On Tue, 2007-07-17 at 00:47 +, John Nagle wrote: > Miles wrote: > > On Jul 16, 1:00 am, John Nagle <[EMAIL PROTECTED]> wrote: > > > >>I'm reading the PhishTank XML file of active phishing sites, > >>at "http://data.phishtank.com/data/online-valid/"; This changes > >>frequently, and it's b

Re: Fetching a clean copy of a changing web page

2007-07-16 Thread Steve Holden
John Nagle wrote: > Miles wrote: >> On Jul 16, 1:00 am, John Nagle <[EMAIL PROTECTED]> wrote: >> >>>I'm reading the PhishTank XML file of active phishing sites, >>> at "http://data.phishtank.com/data/online-valid/"; This changes >>> frequently, and it's big (about 10MB right now) and on a busy

Re: Fetching a clean copy of a changing web page

2007-07-16 Thread John Nagle
Miles wrote: > On Jul 16, 1:00 am, John Nagle <[EMAIL PROTECTED]> wrote: > >>I'm reading the PhishTank XML file of active phishing sites, >>at "http://data.phishtank.com/data/online-valid/"; This changes >>frequently, and it's big (about 10MB right now) and on a busy server. >>So once in a wh

Re: Fetching a clean copy of a changing web page

2007-07-16 Thread John Nagle
Miles wrote: > On Jul 16, 1:00 am, John Nagle <[EMAIL PROTECTED]> wrote: > >>I'm reading the PhishTank XML file of active phishing sites, >>at "http://data.phishtank.com/data/online-valid/"; This changes >>frequently, and it's big (about 10MB right now) and on a busy server. >>So once in a wh

Re: Fetching a clean copy of a changing web page

2007-07-16 Thread Stefan Behnel
Diez B. Roggisch wrote: > John Nagle wrote: >>I'm reading the PhishTank XML file of active phishing sites, >> at "http://data.phishtank.com/data/online-valid/"; This changes >> frequently, and it's big (about 10MB right now) and on a busy server. >> So once in a while I get a bogus copy of the

Re: Fetching a clean copy of a changing web page

2007-07-16 Thread Amit Khemka
On 7/16/07, John Nagle <[EMAIL PROTECTED]> wrote: > I'm reading the PhishTank XML file of active phishing sites, > at "http://data.phishtank.com/data/online-valid/"; This changes > frequently, and it's big (about 10MB right now) and on a busy server. > So once in a while I get a bogus copy of

Re: Fetching a clean copy of a changing web page

2007-07-15 Thread Miles
On Jul 16, 1:00 am, John Nagle <[EMAIL PROTECTED]> wrote: > I'm reading the PhishTank XML file of active phishing sites, > at "http://data.phishtank.com/data/online-valid/"; This changes > frequently, and it's big (about 10MB right now) and on a busy server. > So once in a while I get a bogus

Re: Fetching a clean copy of a changing web page

2007-07-15 Thread Diez B. Roggisch
John Nagle schrieb: >I'm reading the PhishTank XML file of active phishing sites, > at "http://data.phishtank.com/data/online-valid/"; This changes > frequently, and it's big (about 10MB right now) and on a busy server. > So once in a while I get a bogus copy of the file because the file > was

Fetching a clean copy of a changing web page

2007-07-15 Thread John Nagle
I'm reading the PhishTank XML file of active phishing sites, at "http://data.phishtank.com/data/online-valid/"; This changes frequently, and it's big (about 10MB right now) and on a busy server. So once in a while I get a bogus copy of the file because the file was rewritten while being sent b