RE: MARC::Charset question

2007-05-18 Thread Doran, Michael D
Oops, this got mangled somehow... > U+0044 LATIN CAPITAL LETTER D > U+006F LATIN SMALL LETTER O > U+006E LATIN SMALL LETTER N > U+0074 LATIN SMALL LETTER T > U+FE20 LIGATURE, FIRST HALF / COMBINING LIGATURE LEFT HALF > U+0073 LATIN SMALL LETTER S > U+FE21 LIGATURE, SECOND HALF / COMBINING L

Re: MARC::Charset question

2007-05-18 Thread Ed Summers
Michael, would you be willing to work with me to come up with an automated test case to see if this is a problem w/ MARC::Charset? //Ed

RE: MARC::Charset question

2007-05-18 Thread Doran, Michael D
Hi Michael, > An example is the author (personal name) of the book that can > be found at http://catalog.loc.gov/ by searching for ISBN > 5040039875 (I'm guessing the fact that the website appears to > be displaying a corrupted name may be part of the problem here). The Library of Congress cat

MARC::Charset question

2007-05-18 Thread moconnor59
Hi, I'm using marc8_to_utf8() on Library of Congress data. I'm finding that I get occasional null characters inserted in the output text, and I'm wondering what this means. An example is the author (personal name) of the book that can be found at http://catalog.loc.gov/ by searching for ISBN 5040