subject:"Re\: Python Web Scrapping \: Within href readonly those value that have href in it"

Re: Python Web Scrapping : Within href readonly those value that have href in it

2017-01-16 Thread Jesse Alama

To complement what Peter wrote: I'd approach this problem using XPath. XPath is a query language for XML/HTML documents; it's a great tool to have in your web scraping toolbox (among other tasks). With Python's excellent lxml library you can do some XPath processing. Here's how I might tackle this

Re: Python Web Scrapping : Within href readonly those value that have href in it

2017-01-14 Thread Peter Otten

shahs...@gmail.com wrote: > I am trying to scrape a webpage just for learning. In that webpage there > are multiple "a" tags. consider the below code > > Something > > Something These are probaly all forward slashes. > Now i want to read only those href in which there is http. My Current co