On 24 Nov, 05:42, "Just Another Victim of the Ambient Morality" <[EMAIL PROTECTED]> wrote:
> I did find some people who complained about this and rolled their own > solution but I refuse to believe that Python doesn't have a built-in > solution to what must be a very common problem. <snip> Replace "python" with "c++" and would that seem a reasonable belief? (That said I'm a PyN00b) Anyways, for all my HTML processing needs my first port of call has been BeautifulSoup e.g. soup = BeautifulSoup(html, convertEntities="html") print soup.findAll(text=True) Should be in the ballpark of what you want. http://www.crummy.com/software/BeautifulSoup/documentation.html for docs. -- http://mail.python.org/mailman/listinfo/python-list