subject:"Re\: ElementTree.fromstring\(unicode_html\)"

Re: ElementTree.fromstring(unicode_html)

2008-01-27 Thread Fredrik Lundh

globophobe wrote: > In [1]: unicode_html = u'\u3055\u3080\u3044\uff0f\r\n\u3064\u3081\u305f > \u3044\r\n' > > I need to turn this into an elementtree, but some of the data is > japanese whereas the rest is html. This string contains a . where? is an element, not a character. "\r" and "\n" are

Re: ElementTree.fromstring(unicode_html)

2008-01-25 Thread John Machin

On Jan 26, 1:11 pm, globophobe <[EMAIL PROTECTED]> wrote: > This is likely an easy problem; however, I couldn't think of > appropriate keywords for google: > > Basically, I have some raw data that needs to be preprocessed before > it is saved to the database e.g. > > In [1]: unicode_html = u'\u3055