On 13 June 2010 23:56, Gerald Pfeifer wrote: > On Mon, 14 Jun 2010, Gerald Pfeifer wrote: >>>> Thanks, see /home/gccadmin/scripts/update_web_docs_libstdcxx_svn >> The script places up-to-date .html.gz files in the web infrastructure, >> alas we do have older .html there -- and our web server is happy to >> serve those over the compressed ones. >> >> In any case, we surely do not want to have _inconsistent_ versions of >> the same contents there. > > Any objection to me running the following > > find . -name '*.html' | while read f; do > g="$f.gz" > if [ -e "$g" ] && [ "$f" -ot "$g" ]; then > echo "$f is older than $g and should be removed." > fi > done > > where the echo is replaced by an "rm $f"? > > Currently that has 348 hits in /www/gcc/htdocs/onlinedocs/libstdc++.
That seems like the right thing to do. I think it will leave some stale files in place, because the output of generating the html docs creates different filenames sometimes, so there won't be a new .gz for all the old files. Those stale files shouldn't be reachable from any fresh pages. If you could send me a list of all the files remaining under that dir after your command, I will let you know which can safely be removed because they are stale and have been replaced by a new file with a different name.