To complement what Peter wrote: I'd approach this problem using
XPath. XPath is a query language for XML/HTML documents; it's a great
tool to have in your web scraping toolbox (among other tasks). With
Python's excellent lxml library you can do some XPath processing. Here's
how I might tackle this
shahs...@gmail.com wrote:
> I am trying to scrape a webpage just for learning. In that webpage there
> are multiple "a" tags. consider the below code
>
> Something
>
> Something
These are probaly all forward slashes.
> Now i want to read only those href in which there is http. My Current co