On Fri, Jun 4, 2010 at 13:30, Roman Makurin <dro...@gmail.com> wrote:
> I used mostly the same things, but without uri_(un)escape it doesnt work.
> If i clearly understant, in order to parse feed i need apply escape
> methods to broken elements, am i right ?
snip

How you fix each section that is broken is dependent on how that
section is broken.  One thing that may be tripping you up is that some
XML parsers can't handle UTF-8.  Can you tell us what error messages
you are getting after the substitution I suggested?

snip
> i there any module with magic function like fixBrokenXML() ?
snip

How would such a function work?  How would it be able to tell bad XML
like "&amp" from a URL that hasen't been properly escaped like
"http://foo.com?x=1&amp=4";?  In one case it would need to add a
semi-colon and in the other it would need to say
"http://foo.com?x=1&amp;amp=4";,
"http%3A%2F%2Ffoo.com%3Fx%3D1%26amp%3D4", or something similar.

-- 
Chas. Owens
wonkden.net
The most important skill a programmer can have is the ability to read.

-- 
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/


Reply via email to