On May 27, 12:29 pm, powah <wong_po...@yahoo.ca> wrote: > I want to download all mib files from the web > page:http://www.juniper.net/techpubs/software/junos/junos94/swconfig-net-m... > > All mib filenames are of this format:www.juniper.net/techpubs... .txt > > I write this program but has the following error. > Please help. > Thanks. > > [code] > #!/usr/bin/env python > import urllib2,os,urlparse > url="http://www.juniper.net/techpubs/software/junos/junos94/swconfig- > net-mgmt/juniper-specific-mibs-junos-nm.html#jN18E19" > page=urllib2.urlopen(url) > f=0 > links=[] > data=page.read().split("\n") > for item in data: > if "www.juniper.net/techpubs" in item: > httpind=item.index("www.juniper.net/techpubs") > item=item[httpind:] > #print "item " + item > ind=item.index("<") > links.append(item[:ind]) #grab all links > # download all links > for link in links: > print "link " + link > filename=link.split("/")[-1] > print "downloading ... " + filename > u=urllib2.urlopen(link) > p=u.read() > open(filename,"w").write(p) > [/code] > > $ ~/python/downloadjuniper.py > linkwww.juniper.net/techpubs/software/junos/junos94/swconfig-net-mgmt/mib... > downloading ... mib-jnx-user-aaa.txt > Traceback (most recent call last): > File "/home/powah/python/downloadjuniper.py", line 20, in ? > u=urllib2.urlopen(link) > File "/usr/lib/python2.4/urllib2.py", line 130, in urlopen > return _opener.open(url, data) > File "/usr/lib/python2.4/urllib2.py", line 350, in open > protocol = req.get_type() > File "/usr/lib/python2.4/urllib2.py", line 233, in get_type > raise ValueError, "unknown url type: %s" % self.__original > ValueError: unknown url > type:www.juniper.net/techpubs/software/junos/junos94/swconfig-net-mgmt/mib... > > $ python > Python 2.4.4 (#1, Oct 23 2006, 13:58:00) > [GCC 4.1.1 20061011 (Red Hat 4.1.1-30)] on linux2 > Type "help", "copyright", "credits" or "license" for more information. > > > > My computer is FC6 linux.
I fixed one error, now if the filename is misspelled, how to ignore the error and continue? [code] #!/usr/bin/env python import urllib2,os,urlparse url="http://www.juniper.net/techpubs/software/junos/junos94/swconfig- net-mgmt/juniper-specific-mibs-junos-nm.html#jN18E19" page=urllib2.urlopen(url) f=0 links=[] data=page.read().split("\n") for item in data: if "www.juniper.net/techpubs" in item: httpind=item.index("www.juniper.net/techpubs") item=item[httpind:] #print "item " + item ind=item.index(".txt") + 4 links.append(item[:ind]) #grab all links # download all links for link in links: filename=link.split("/")[-1] link = "http://" + link print "link " + link print "downloading ... " + filename u=urllib2.urlopen(link) p=u.read() open(filename,"w").write(p) [/code] $ ~/python/downloadjuniper_onepage.py link http://www.juniper.net/techpubs/software/junos/junos94/swconfig-net-mgmt/mib-jnx-virtual-chassis.txt downloading ... mib-jnx-virtual-chassis.txt Traceback (most recent call last): File "/home/powah/python/downloadjuniper_onepage.py", line 7, in ? u=urllib2.urlopen(link) File "/usr/lib/python2.4/urllib2.py", line 130, in urlopen return _opener.open(url, data) File "/usr/lib/python2.4/urllib2.py", line 364, in open response = meth(req, response) File "/usr/lib/python2.4/urllib2.py", line 471, in http_response response = self.parent.error( File "/usr/lib/python2.4/urllib2.py", line 402, in error return self._call_chain(*args) File "/usr/lib/python2.4/urllib2.py", line 337, in _call_chain result = func(*args) File "/usr/lib/python2.4/urllib2.py", line 480, in http_error_default raise HTTPError(req.get_full_url(), code, msg, hdrs, fp) urllib2.HTTPError: HTTP Error 404: Not Found -- http://mail.python.org/mailman/listinfo/python-list