Ed Summers wrote:
On Tue, Nov 23, 2004 at 04:10:05PM -0600, John Hammer wrote:
I have a character problem that I hope someone can help me with. In
a MARC record I am modifying using MARC::Record, one of the names
contains letters with diacritics. Looking at the name with a hex editor,
it gives, with hex values in curly brackets,"Bis{e5}a{f2}t{e5}i,
Mu{f2}hammad." After running through MARC::Record, the name now appears
as "Bis{ef bf bd}a{ef bf bd}t{ef bf bd}i, Mu{ef bf bd}hammad."
That's pretty odd. Any chance you could send me the MARC record? At this
time MARC::Record does not play nicely with Unicode (UTF8).
http://rt.cpan.org/NoAuth/Bug.html?id=3707
It is possible they are MARC-8 characters rather than utf-8. In MARC-8
E5 is "macron" and F2 is "dot below." Is MARC::Record trying to treat
than as Unicode when in fact they are MARC-8?
Ashley.
--
Ashley Sanders [EMAIL PROTECTED]
Copac http://copac.ac.uk -- A MIMAS service funded by JISC