RFC1738 - Uniform Resource Locators (URL) http://www.faqs.org/rfcs/rfc1738.html
RFC1808 - Relative Uniform Resource Locators http://www.faqs.org/rfcs/rfc1808.html > Hi, I'm using httpcli to save a webpage html doc and I extract all of > it's image locations to a text file by saving the '<IMG SRC=' tags. > Afterward I want to download all of the images, but how can I determine > the TRUE location of the images? For example, say the image tag is: > '<IMG SRC='test.com/photo.jpg'' - for all I know, "test.com" could just > be a directory on the server or it could be the website. Another > example, say the image tag is: '<IMG SRC='/photo.jpg'' - so the image is > in the root directory of the website, but who knows what the root > directory is? It may simply be 'test.com', or if the html doc is located > in a subdirectory, it may be something like 'test.com/users/me'. > > So, what is the appropriate way to determine the actual true location of > these images from the 'IMG' tags? > > Much thanks in advance. > -- To unsubscribe or change your settings for TWSocket mailing list please goto http://lists.elists.org/cgi-bin/mailman/listinfo/twsocket Visit our website at http://www.overbyte.be