> From: Joshua Ferraro [mailto:[EMAIL PROTECTED] 
> Sent: 19 May, 2006 13:40
> To: Edward Summers
> Cc: perl4lib
> Subject: Re: MARC Records, XML, and encoding
> 
> Hi all,
> 
> Here is an OCLC record:
> 
> http://liblime.com/public/oclc1.dat
> 
> So ... any suggestions for tracking down this problem? ... 
> and what about ideas for handling these records 'in the wild' 
> that have some encoding problems... what do other MARC libraries do?

I was curious about whether this record was bad in WorldCat and since
I have access to WorldCat, I looked at the record.  There appears to
be one diacritic in this record, a MARC-8 E2, combining acute, which
has "e" as its base character.  I exported the record from WorldCat
and it does in fact have an E2 in it.

However, the size of the record, above, and the one I exported from
OCLC Connexion are different.  Above, 1442 bytes vs. OCLC 1387.  The
005's are, above 20060516100102.0 vs. OCLC 20060519162028.0.  So it's
not surprising that the sizes are different.

When I use MarcView on both records it doesn't complain and looking
at both records side-by-side it appears that there are very minor
edits.  I suspect that the record was edited on OCLC, then exported,
where as I just exported the record without making any edits.

This doesn't solve your issue, but I don't think the issue is with
the actual content of the record.


Andy.

Andrew Houghton, OCLC Online Computer Library Center, Inc.
http://www.oclc.org/about/
http://www.oclc.org/research/staff/houghton.htm

Reply via email to