Many of the links off of the main page are ajax.  I don't think that
wget can scrape that very easily.

On Oct 12, 11:10 am, Tassilo Horn <tass...@member.fsf.org> wrote:
> jingguo <yaojing...@gmail.com> writes:
>
> Hi!
>
> > When programming in Clojure, I use the Reference documentation on
> > clojure.org a lot. But my network condition is horrible. So I want to
> > make a off-line copy. I have tried "wget -x -m -khttp://clojure.org";
> > to make a mirror of clojure.org. But it does not work.
>
> First, I've thought it was due to a restrictive robots.txt on the clojure
> server, so I tried
>
>   wget -e robots=off -x -m -k --wait 1http://clojure.org/
>
> but that also downloads just the index page...
>
> But I guess for programming clojure, you mostly want only the API docs,
> and that can be mirrored perfectly fine with
>
>   wget -xmkhttp://clojure.github.com/clojure/
>
> HTH,
> Tassilo

-- 
You received this message because you are subscribed to the Google
Groups "Clojure" group.
To post to this group, send email to clojure@googlegroups.com
Note that posts from new members are moderated - please be patient with your 
first post.
To unsubscribe from this group, send email to
clojure+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/clojure?hl=en

Reply via email to