Re: Parsing html with Beautifulsoup

2009-12-14 Thread Gabriel Genellina
En Mon, 14 Dec 2009 03:58:34 -0300, Johann Spies escribió: On Sun, Dec 13, 2009 at 07:58:55AM -0300, Gabriel Genellina wrote: cell.findAll(text=True) returns a list of all text nodes inside a cell; I preprocess all \n and   in each text node, and join them all. lines is a list of lists (each

Re: Parsing html with Beautifulsoup

2009-12-13 Thread Johann Spies
On Sun, Dec 13, 2009 at 07:58:55AM -0300, Gabriel Genellina wrote: > this code should serve as a starting point: Thank you very much! > cell.findAll(text=True) returns a list of all text nodes inside a > cell; I preprocess all \n and   in each text node, and > join them all. lines is a list of

Re: Parsing html with Beautifulsoup

2009-12-13 Thread Gabriel Genellina
En Fri, 11 Dec 2009 04:04:38 -0300, Johann Spies escribió: Gabriel Genellina het geskryf: En Thu, 10 Dec 2009 06:15:19 -0300, Johann Spies escribió: How do I get Beautifulsoup to render (taking the above line as example) sunentint for  sunetint and still provide the text-parts in the

Re: Parsing html with Beautifulsoup

2009-12-10 Thread Johann Spies
Gabriel Genellina het geskryf: En Thu, 10 Dec 2009 06:15:19 -0300, Johann Spies escribió: How do I get Beautifulsoup to render (taking the above line as example) sunentint for  sunetint and still provide the text-parts in the 's with plain text? Hard to tell if we don't see what's inside

Re: Parsing html with Beautifulsoup

2009-12-10 Thread Gabriel Genellina
En Thu, 10 Dec 2009 06:15:19 -0300, Johann Spies escribió: How do I get Beautifulsoup to render (taking the above line as example) sunentint for  sunetint and still provide the text-parts in the 's with plain text? Hard to tell if we don't see what's inside those 's - please provide at

Parsing html with Beautifulsoup

2009-12-10 Thread Johann Spies
I am trying to get csv-output from a html-file. With this code I had a little success: = from BeautifulSoup import BeautifulSoup from string import replace, join import re f = open("configuration.html","r") g = open("configuration.csv",'w') soup = BeautifulSoup(f) t = soup