Hi, One of my packages (libboost-dev) has a lot of documentation as HTML files, complete with inlined graphics and the like. However, these files are all mixed in with the source code.
I need a maintainable way to get a list of these files. There must be some tool that will parse a set of html files (recursively for all relative links) and give me back a list of files linked to by <a href=..> and <img> and whatnot. In short: I need a list of all the files that make up the documentation, starting from "index.html". Suggestions? Thanks, -S -- by Rocket to the Moon, by Airplane to the Rocket, by Taxi to the Airport, by Frontdoor to the Taxi, by throwing back the blanket and laying down the legs ... - They Might Be Giants -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]