Displaying diacritics in a terminal vs. a browser

2004-07-01 Thread Christopher Morgan
Hi all, I use the $cs->to_utf8 conversion from MARC::Charset to display MARC Authority records in a browser, and the diacritics display properly there. But they don't display properly via SDTOUT in my terminal window (I get two characters instead of one -- one with the letter and one with the ac

RE: Displaying diacritics in a terminal vs. a browser

2004-07-01 Thread Houghton,Andrew
> From: Christopher Morgan [mailto:[EMAIL PROTECTED] > Sent: 01 July, 2004 10:50 > Subject: Displaying diacritics in a terminal vs. a browser > > I use the $cs->to_utf8 conversion from MARC::Charset to > display MARC Authority records in a browser, and the > diacritics display properly there. >

RE: Displaying diacritics in a terminal vs. a browser

2004-07-01 Thread Christopher Morgan
Andy, Many thanks. I'd be interested in looking at your scripts if you do post them! -- Chris -Original Message- From: Houghton,Andrew [mailto:[EMAIL PROTECTED] Sent: Thursday, July 01, 2004 10:23 AM To: [EMAIL PROTECTED] Subject: RE: Displaying diacritics in a terminal vs. a browser

Re: Displaying diacritics in a terminal vs. a browser

2004-07-01 Thread Paul Hoffman
Unless I'm very much mistaken, Chris's code is outputting UTF-8 to the terminal, not MARC-8. The key is to find a terminal program that correctly displays UTF-8. I doubt you'll have any trouble finding one -- for example, there are at least two for Mac OS X alone (Terminal.app and iTerm). Depending

RE: Displaying diacritics in a terminal vs. a browser

2004-07-01 Thread Houghton,Andrew
> From: Paul Hoffman [mailto:[EMAIL PROTECTED] > Sent: 01 July, 2004 11:57 > Subject: Re: Displaying diacritics in a terminal vs. a browser > > Unless I'm very much mistaken, Chris's code is outputting > UTF-8 to the terminal, not MARC-8. > >> From: Christopher Morgan [mailto:[EMAIL PROTECTED]

Re: Displaying diacritics in a terminal vs. a browser

2004-07-01 Thread Ed Summers
On Thu, Jul 01, 2004 at 11:22:42AM -0400, Houghton,Andrew wrote: > I'm not sure what MARC::Charset does internally, but MARC-8 > defines the diacritic separate from the base character. So > even using binmode(STDOUT,":utf8") will produce two characters, > one for the base character followed by t

RE: Displaying diacritics in a terminal vs. a browser

2004-07-01 Thread Christopher Morgan
Jane, Thanks very much for the information about Unicode and MARC-8. I still have a lot to learn about the two formats! Since my MARC data is being manipulated primarily in a browser via a cgi script, I'll forego writing a converter for the terminal display for now, but I eventually plan to d

Re: Displaying diacritics in a terminal vs. a browser

2004-07-01 Thread Ed Summers
> A MARC-8 sequence places a combining diacritical mark BEFORE the letter > it's supposed to combine. Whereas Unicode syntax is to put it AFTER the > letter it's supposed to combine with. > > Hence for example the letter: ZÌ > is produced by the MARC-8 Sequence: > 75 5A (macron below + "Z")