Re: Parsing markup.

2010-11-29 Thread Alan Meyer
On 11/29/2010 11:20 AM, Joe Goldthwaite wrote: Hi MRAB, I was trying to avoid regex because my poor old brain has trouble with it. I have to admin though, that line is slick! I'll have to go through my regex documentation to try and figure out what it actually means. Personally, I'd be hesita

Re: Parsing markup.

2010-11-29 Thread Stefan Behnel
Jon Clements, 26.11.2010 13:58: On Nov 26, 4:03 am, MRAB wrote: On 26/11/2010 03:28, Joe Goldthwaite wrote: > I’m attempting to parse some basic tagged markup. The output of the > TinyMCE editor returns a string that looks something like this; > > This is a paragraph withbold andit

RE: Parsing markup.

2010-11-29 Thread Joe Goldthwaite
s+joe=goldthwaites@python.org [mailto:python-list-bounces+joe=goldthwaites@python.org] On Behalf Of MRAB Sent: Thursday, November 25, 2010 9:03 PM To: python-list@python.org Subject: Re: Parsing markup. On 26/11/2010 03:28, Joe Goldthwaite wrote: > I'm attempting to parse some basic tagged

Re: Parsing markup.

2010-11-26 Thread Jon Clements
On Nov 26, 4:03 am, MRAB wrote: > On 26/11/2010 03:28, Joe Goldthwaite wrote: >  > I’m attempting to parse some basic tagged markup.  The output of the >  > TinyMCE editor returns a string that looks something like this; >  > >  > This is a paragraph with bold and italic elements in >  > itIt can

Re: Parsing markup.

2010-11-25 Thread Javier Collado
Hello, 2010/11/26 Joe Goldthwaite : > I’m attempting to parse some basic tagged markup. > >  Elementree and lxml seem to want a full formatted > page, not a small segment like this one. BeautifulSoup (http://www.crummy.com/software/BeautifulSoup/) could help in the parsing: >>> from BeautifulSou

Re: Parsing markup.

2010-11-25 Thread MRAB
On 26/11/2010 03:28, Joe Goldthwaite wrote: > I’m attempting to parse some basic tagged markup. The output of the > TinyMCE editor returns a string that looks something like this; > > This is a paragraph with bold and italic elements in > itIt can be made up of multiple lines separated by pagagra