Same with sites that have negligible daily changes (like today's date
dynamically inserted), or random changes (a random quote, tip, stock quote,
product, image, etc etc would all screw that up).

Justin


on 26/09/02 11:03 PM, Marek Kilimajer ([EMAIL PROTECTED]) wrote:

> Hope the sites have no banners :),  they change all the time
> 
> John Holmes wrote:
> 
>> You could cache/save the actual contents of the file, then when you read
>> it next time, compare it to what you saved and see if it changed. You
>> may want to filter out everything but what's between <body> and </body>,
>> so you're not thinking it changed just b/c of something in the
>> headers...
>> 
>> ---John Holmes...
>> 
>> 
>> 
>>> -----Original Message-----
>>> From: Vikram Vaswani [mailto:[EMAIL PROTECTED]]
>>> Sent: Thursday, September 26, 2002 7:04 AM
>>> To: [EMAIL PROTECTED]
>>> Subject: [PHP] Finding out when a Web page has changed
>>> 
>>> Hi all,
>>> 
>>> I need to write an application that accepts a list of URLs and checks
>>> 
>>> 
>> them
>> 
>> 
>>> on a daily basis (via cron) to see if the pages have changed in the
>>> 
>>> 
>> past
>> 
>> 
>>> day.
>>> 
>>> I need some help with this. Does anyone know the most optimal way to
>>> 
>>> 
>> find
>> 
>> 
>>> out when a particular Web page has been modified? I am thinking about
>>> using
>>> the Last-Modified: HTTP header - however, all servers do not return
>>> 
>>> 
>> this
>> 
>> 
>>> header - any ideas on what the fallback should be?
>>> 
>>> TIA,
>>> 
>>> Vikram
>>> --
>>> "I find your lack of faith disturbing."
>>> --Darth Vader
>>> 
>>> --
>>> PHP General Mailing List (http://www.php.net/)
>>> To unsubscribe, visit: http://www.php.net/unsub.php
>>> 
>>> 
>> 
>> 
>> 
>> 
>> 
> 


-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php

Reply via email to