On Tue, 2013-01-01 at 20:01 +0000, Ken Moffat wrote:
> On Tue, Jan 01, 2013 at 01:21:26PM -0600, Gerard Beekmans wrote:
> > Hi guys,
> > 
> > After reviewing logs I ended up having to block the wget user agent in 
> > Apache for the time being. Pages such as 
> > http://www.linuxfromscratch.org/lfs/downloads/stable/ are causing issues 
> > with wget.
> > 
> > The name, last modified, size and description headers are clickable 
> > links to change the sorting of the page. This tricks wget's recursive 
> > mode into thinking they are different pages.
> 
>  Is this something we can change in the future, somewhere in the xml,
> or is it another of those "we miss Manuel" moments ?

Nope, that page is served out by Apache using its autoindex module.

Gerard, we could just configure Apache to use
'SuppressColumnSorting' 
(http://httpd.apache.org/docs/2.2/mod/mod_autoindex.html#indexoptions) - it 
won't stop bots from downloading masses of data if that's what they're intent 
on doing, but for otherwise innocent scripts that are being tripped up by the 
column sorting hyperlinks, it'll prevent them getting multiple copies of 
everything.

Regards,

Matt.

-- 
http://linuxfromscratch.org/mailman/listinfo/lfs-dev
FAQ: http://www.linuxfromscratch.org/faq/
Unsubscribe: See the above information page

Reply via email to