Re: Possible bug in XML::SAX causing new_from_xml() to croak

2006-05-19 Thread Edward Summers
On May 19, 2006, at 9:57 PM, Edward Summers wrote: $parser = XML::SAX::ParserFactory->new(); oops that should've been: $parser = XML::SAX::ParserFactory->parser(); I also should have mentioned that you can tell XML::SAX::ParserFactory which underlying parser to use, but I fo

Re: Possible bug in XML::SAX causing new_from_xml() to croak

2006-05-19 Thread Edward Summers
On May 19, 2006, at 7:59 PM, Joshua Ferraro wrote: I've attached a small script that reproduces the same error we're getting in the new_from_xml() method. Try it out and see what it does for you. Works ok for me, at least it doesn't crash :-) So ... Is there a workaround that we can use to fi

Possible bug in XML::SAX causing new_from_xml() to croak

2006-05-19 Thread Joshua Ferraro
Hi all, OK ... I think I've narrowed down the problem to a bug in XML::SAX when dealing with combining characters, but I'd like some feedback before I go and post this bug ... I'm new to SAX and XML parsing/ encoding issues in general ... I've attached a small script that reproduces the same erro

Re: MARC Records, XML, and encoding

2006-05-19 Thread Joshua Ferraro
Hi Andy, Thanks for checking on that. I can confirm that the record has E2 in it and that when I replace E2 with a ? character, new_from_xml() doesn't croak. I was able to find the E2 character in the code tables provided by LOC (which I assume are what MARC::Charset uses). Here is what they've g

RE: MARC Records, XML, and encoding

2006-05-19 Thread Houghton,Andrew
> From: Joshua Ferraro [mailto:[EMAIL PROTECTED] > Sent: 19 May, 2006 13:40 > To: Edward Summers > Cc: perl4lib > Subject: Re: MARC Records, XML, and encoding > > Hi all, > > Here is an OCLC record: > > http://liblime.com/public/oclc1.dat > > So ... any suggestions for tracking down this probl

Re: MARC Records, XML, and encoding

2006-05-19 Thread Joshua Ferraro
Hi all, Here is an OCLC record: http://liblime.com/public/oclc1.dat I feed it into the as_xml method and I get what appears to be valid XML: http://liblime.com/public/oclc1.xml When I take that xml and feed it to the new_from_xml method and print it to a file I get the error: Cannot decode st