Hi all,
I use the $cs->to_utf8 conversion from MARC::Charset to display MARC
Authority records in a browser, and the diacritics display properly there.
But they don't display properly via SDTOUT in my terminal window (I get two
characters instead of one -- one with the letter and one with the ac
> From: Christopher Morgan [mailto:[EMAIL PROTECTED]
> Sent: 01 July, 2004 10:50
> Subject: Displaying diacritics in a terminal vs. a browser
>
> I use the $cs->to_utf8 conversion from MARC::Charset to
> display MARC Authority records in a browser, and the
> diacritics display properly there.
>
Andy,
Many thanks. I'd be interested in looking at your scripts if you do post
them!
-- Chris
-Original Message-
From: Houghton,Andrew [mailto:[EMAIL PROTECTED]
Sent: Thursday, July 01, 2004 10:23 AM
To: [EMAIL PROTECTED]
Subject: RE: Displaying diacritics in a terminal vs. a browser
Unless I'm very much mistaken, Chris's code is outputting UTF-8 to
the terminal, not MARC-8.
The key is to find a terminal program that correctly displays UTF-8.
I doubt you'll have any trouble finding one -- for example, there
are at least two for Mac OS X alone (Terminal.app and iTerm).
Depending
> From: Paul Hoffman [mailto:[EMAIL PROTECTED]
> Sent: 01 July, 2004 11:57
> Subject: Re: Displaying diacritics in a terminal vs. a browser
>
> Unless I'm very much mistaken, Chris's code is outputting
> UTF-8 to the terminal, not MARC-8.
> >> From: Christopher Morgan [mailto:[EMAIL PROTECTED]
On Thu, Jul 01, 2004 at 11:22:42AM -0400, Houghton,Andrew wrote:
> I'm not sure what MARC::Charset does internally, but MARC-8
> defines the diacritic separate from the base character. So
> even using binmode(STDOUT,":utf8") will produce two characters,
> one for the base character followed by t
Jane,
Thanks very much for the information about Unicode and MARC-8. I still have a lot to
learn about the two formats! Since my MARC data is being manipulated primarily in a
browser via a cgi script, I'll forego writing a converter for the terminal display for
now, but I eventually plan to d
> A MARC-8 sequence places a combining diacritical mark BEFORE the letter
> it's supposed to combine. Whereas Unicode syntax is to put it AFTER the
> letter it's supposed to combine with.
>
> Hence for example the letter: ZÌ
> is produced by the MARC-8 Sequence:
> 75 5A (macron below + "Z")