On 22 December 2011 20:35, Andy Wingo <wi...@pobox.com> wrote: >> Not sure precisely what you mean here. Is it something like: >> >> (or (false-if-exception (parse-date str)) >> (and (memq str '("0" "-1")) str) >> date-in-the-past) > > More like: > > (if (member str '("0" "-1")) > date-in-the-past > (parse-date str)) > > Then we can wait and see -- if only these two values are out there, then > we are good, and we keep the "validating" characteristic of our date > parser. Otherwise we can fall back to the false-if-exception dance if > someone submits a bug report.
A rough check against ~2600 sites scraped from dmoz.org shows only a handful with other values. These two: "Mon, 12 Jul 1996 1:00:00 GMT" ^ misses leading `0' "Thu, 01 Jan 1970 00:00:00 +0000" ^ should be `GMT' The second (use of `+0000') was also encountered amongst other date-valued headers in ~1% of pages sampled. There might be a case here for relaxing `parse-date' as I don't think these should be handled specifically for "Expires" headers. There were three more like: "{ts '2011-12-27 08:12:22'}" which only appeared for "Expires" headers. They look something like server directives which should have been transformed to legit expiration dates but haven't been, due to misconfiguration. In this case I'd rather throw an error than parse it (wrongly) to date-in-the-past. Given those points, I have attached a patch implementing the suggested handling for "Expires" and will take a look at perhaps relaxing parse-date (and others). Anyone have ideas on that? Daniel
From 8b7eda0bd7b03467f6eef0ce6c99dedf8fd3ac0c Mon Sep 17 00:00:00 2001 From: Daniel Hartwig <mand...@gmail.com> Date: Tue, 27 Dec 2011 22:24:28 +0800 Subject: [PATCH] permit non-date values for Expires header * module/web/http.scm ("Expires"): Permit (some) non-date values. --- module/web/http.scm | 10 +++++++++- 1 files changed, 9 insertions(+), 1 deletions(-) diff --git a/module/web/http.scm b/module/web/http.scm index afe70a7..9bb4449 100644 --- a/module/web/http.scm +++ b/module/web/http.scm @@ -1506,7 +1506,15 @@ phrase\"." ;; Expires = HTTP-date ;; -(declare-date-header! "Expires") +(define *date-in-the-past* (parse-date "Thu, 01 Jan 1970 00:00:00 GMT")) + +(declare-header! "Expires" + (lambda (str) + (if (member str '("0" "-1")) + *date-in-the-past* + (parse-date str))) + date? + write-date) ;; Last-Modified = HTTP-date ;; -- 1.7.5.4