Re: SAXParseException: not well-formed (invalid token)

2007-08-30 Thread Lawrence D'Oliveiro
In message <[EMAIL PROTECTED]>, Carsten Haese wrote: > If your input string contains the byte 0xe9 where your accented e is, > the file is actually latin-1 encoded. If it contains the byte sequence > 0xc3,0xa9 it is UTF-8 encoded. It is dismaying how often I come across Web pages that claim to be

Re: SAXParseException: not well-formed (invalid token)

2007-08-30 Thread Carsten Haese
On Thu, 2007-08-30 at 15:20 +0200, Pablo Rey wrote: > About the possibility that you mention to recoding the input, could you > let me know how to do it?. I am sorry I am starting with Python and I > don't know how to do it. While I answered this question in my previous reply, I wanted to

Re: SAXParseException: not well-formed (invalid token)

2007-08-30 Thread Carsten Haese
On Thu, 2007-08-30 at 15:20 +0200, Pablo Rey wrote: > Hi Stefan, > > The xml has specified an encoding ( ?>). It's possible that the encoding specification is incorrect: >>> u = u"\N{LATIN SMALL LETTER E WITH ACUTE}" >>> print repr(u.encode("latin-1")) '\xe9' >>> print repr(u.encode(

Re: SAXParseException: not well-formed (invalid token)

2007-08-30 Thread Marc 'BlackJack' Rintsch
On Thu, 30 Aug 2007 15:31:58 +0200, Pablo Rey wrote: > On 30/08/2007 14:35, Marc 'BlackJack' Rintsch wrote: > >> Maybe you can download that XML file and use `xmllint` to check if it >> is well formed XML!? > > This is the output of the xmllint command: > > [EMAIL PROTECTED] voms2users]$ x

Re: SAXParseException: not well-formed (invalid token)

2007-08-30 Thread Pablo Rey
Hi Stefan, The xml has specified an encoding (). About the possibility that you mention to recoding the input, could you let me know how to do it?. I am sorry I am starting with Python and I don't know how to do it. Thanks by your help. Pablo

Re: SAXParseException: not well-formed (invalid token)

2007-08-30 Thread Stefan Behnel
Pablo Rey wrote: > I am getting the following error with a XML page: > >> File "/home/prey/RAL-CESGA/bin/voms2users/voms2users.py", line 69, >> in getItems >> d = minidom.parseString(xml.read()) >> File "/usr/lib/python2.2/site-packages/_xmlplus/dom/minidom.py", >> line 967, in parseSt

Re: SAXParseException: not well-formed (invalid token)

2007-08-30 Thread Marc 'BlackJack' Rintsch
On Thu, 30 Aug 2007 13:46:47 +0200, Pablo Rey wrote: > The page is > https://lcg-voms.cern.ch:8443/voms/cms/services/VOMSCompatibility?method=getGridmapUsers > > and the line with the invalid character is (the invalid character is the > final é of Université): The URL doesn't work for m

SAXParseException: not well-formed (invalid token)

2007-08-30 Thread Pablo Rey
Dear Colleagues, I am getting the following error with a XML page: > File "/home/prey/RAL-CESGA/bin/voms2users/voms2users.py", line 69, in > getItems > d = minidom.parseString(xml.read()) > File "/usr/lib/python2.2/site-packages/_xmlplus/dom/minidom.py", line 967, > in p

Re: Please help!! SAXParseException: not well-formed (invalid token)

2007-03-27 Thread Diez B. Roggisch
[EMAIL PROTECTED] schrieb: > I checked the file format (of the file containing the n-tilde - ñ) and > it is indeed UTF-8! I'm baffled! Any ideas? Without you showing us your actual code and data - no. Because it works for me and a lot of other people. Diez -- http://mail.python.org/mailman/lis

Re: Please help!! SAXParseException: not well-formed (invalid token)

2007-03-27 Thread jvictor118
I checked the file format (of the file containing the n-tilde - ñ) and it is indeed UTF-8! I'm baffled! Any ideas? Thanks, Jason On Mar 27, 11:16 am, "Diez B. Roggisch" <[EMAIL PROTECTED]> wrote: > [EMAIL PROTECTED] wrote: > > I've been using the xml.sax.handler module to do event-driven parsing

Re: Please help!! SAXParseException: not well-formed (invalid token)

2007-03-27 Thread Diez B. Roggisch
[EMAIL PROTECTED] wrote: > I've been using the xml.sax.handler module to do event-driven parsing > of XML files in this python application I'm working on. However, I > keep having really pesky invalid token exceptions. Initially, I was > only getting them on control characters, and a little "sed -

Re: Please help!! SAXParseException: not well-formed (invalid token)

2007-03-27 Thread kyosohma
On Mar 27, 9:59 am, [EMAIL PROTECTED] wrote: > I've been using the xml.sax.handler module to do event-driven parsing > of XML files in this python application I'm working on. However, I > keep having really pesky invalid token exceptions. Initially, I was > only getting them on control characters,

Please help!! SAXParseException: not well-formed (invalid token)

2007-03-27 Thread jvictor118
I've been using the xml.sax.handler module to do event-driven parsing of XML files in this python application I'm working on. However, I keep having really pesky invalid token exceptions. Initially, I was only getting them on control characters, and a little "sed -e 's/ [^[:print:]]/ /g' $1;" took