Here is a pull request for the changes needed to have hadoop access archive.org s3,
https://github.com/apache/hadoop/pull/79 I now have access to archive.org working in my test case. See my test case here : https://github.com/h4ck3rm1k3/hadoop-archive-org-bucket-fs new URI("s3://bucketname").getHost() is returning null, and I wonder how this ever worked. I switched it to use the authority because it seems that java URI is treating it as "hierarchical URI" [*scheme**:*][*//**authority*][*path*][*?**query*][*#**fragment*] see http://cr.openjdk.java.net/~mr/jigsaw/spec/api/java/net/URI.html I am new to this project so please excuse my mistakes and maybe I missed something obvious on how to do this, thanks, mike -- James Michael DuPont Kansas Linux Fest http://kansaslinuxfest.us Free/Libre Open Source and Open Knowledge Association of Kansas http://openkansas.us Member of Free Libre Open Source Software Kosova http://www.flossk.org Saving Wikipedia(tm) articles from deletion http://SpeedyDeletion.wikia.com