Tim Roberts wrote:
"bruce" <[EMAIL PROTECTED]> wrote:
it's the beautifulsoup() that's taking the "&E" and giving the "&E;"...
Right, as it should. "A&E" is not valid HTML, and beautifulsoup expects
valid HTML.
This can be difficult to fix in the general case, because your page might
already
"bruce" <[EMAIL PROTECTED]> wrote:
>
>it's the beautifulsoup() that's taking the "&E" and giving the "&E;"...
Right, as it should. "A&E" is not valid HTML, and beautifulsoup expects
valid HTML.
This can be difficult to fix in the general case, because your page might
already contain "&". If it
aha...
it's the beautifulsoup() that's taking the "&E" and giving the "&E;"...
-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] Behalf
Of Fredrik Lundh
Sent: Sunday, August 31, 2008 1:10 PM
To: python-list@python.org
Subject:
riginal Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] Behalf
Of Fredrik Lundh
Sent: Sunday, August 31, 2008 1:10 PM
To: python-list@python.org
Subject: Re: parsing "&A" in a string..
bruce wrote:
> a pretty simple question, i'm guessing.
>
> i have a text/h
bruce wrote:
a pretty simple question, i'm guessing.
i have a text/html string that looks like:
(A&E)
the issue i have is that when i parse it using xpath/node/toString,
i get the following
...(A&E;).
that's because your parser is interpreting the &E part as an entity
reference