RE: Ignoring Diacritics accessing Fixed Field Data

2005-01-11 Thread Jacobs, Jane W
That worked well! Thanks! JJ **Views expressed by the author do not necessarily represent those of the Queens Library.** Jane Jacobs Asst. Coord., Catalog Division Queens Borough Public Library 89-11 Merrick Blvd. Jamaica, NY 11432 tel.: (718) 990-0804 e-mail: [EMAIL PROTECTED] FAX. (718) 990-8

RE: Ignoring Diacritics accessing Fixed Field Data

2005-01-11 Thread Doran, Michael D
A bulletin from the "haste makes waste" department... > $ME =~ s/[\xE1-\xFE]//g; > $TITLE =~ s/[\xE1-\xFE]//g; Ooops, that should be "E0" instead of "E1" as the first hex value in the substitutions: $ME =~ s/[\xE0-\xFE]//g;

RE: Ignoring Diacritics accessing Fixed Field Data

2005-01-11 Thread Bryan Baldus
On Tuesday, January 11, 2005 2:13 PM, Michael Doran wrote: >Assuming that you asking how to strip out the MARC-8 combining diacritic characters, try inserting the substitution >commands listed (as shown below) just prior to the substr commands: >> my $ME = $field->subfield('a'); >

RE: Ignoring Diacritics accessing Fixed Field Data

2005-01-11 Thread Doran, Michael D
Hi Jane, These answers assume that the data you are processing: 1) is encoded in the MARC-8 character set, and 2) consists of the MARC-8 default basic and extended Latin characters. > Dave,Ayod\2003 > Paòt,Kaâs\2002 > Baks,Dasa\2003 > ,Viâs\2002 > > Problem 1: As you can see, I don't really want

RE: Ignoring Diacritics accessing Fixed Field Data

2005-01-11 Thread Jacobs, Jane W
> Problem 1: As you can see, I don't really want the first four > characters, I want the first four SEARCHABLE characters. How can I > tell MARC Record to give me the first four characters, excluding > diacritics? What output would you have rather seen? Dave,Ayod\2003 Paot, Kaas\2002

Re: Ignoring Diacritics accessing Fixed Field Data

2005-01-11 Thread Ed Summers
Hi Jane: On Tue, Jan 11, 2005 at 01:29:55PM -0500, Jacobs, Jane W wrote: > My result was something like: > > Dave,Ayod\2003 > Paòt,Kaâs\2002 > Baks,Dasa\2003 > ,Viâs\2002 > > Problem 1: As you can see, I don't really want the first four characters, I > want the first four SEARCHABLE characters.

Ignoring Diacritics accessing Fixed Field Data

2005-01-11 Thread Jacobs, Jane W
Hi folks, I'm trying to write a routine to construct a text file of OCLC search key from a group of existing records. What I want is something like: Brah,vasa/2003 That is 1st four letters of 100 + comma + 1st four letters of 245 + slash + date. In principle I have this working with: open(