Hi,

One of my packages (libboost-dev) has a lot of documentation as HTML
files, complete with inlined graphics and the like.  However, these
files are all mixed in with the source code.

I need a maintainable way to get a list of these files.

There must be some tool that will parse a set of html files
(recursively for all relative links) and give me back a list of files
linked to by <a href=..> and <img> and whatnot.  In short: I need
a list of all the files that make up the documentation, starting
from "index.html".

Suggestions?

Thanks,
-S

-- 
by Rocket to the Moon,
by Airplane to the Rocket,
by Taxi to the Airport,
by Frontdoor to the Taxi,
by throwing back the blanket and laying down the legs ...
- They Might Be Giants


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]


Reply via email to