Hello,
The way the 4,4 keys usually work (in our case at least)
is that it is the first four characters of the first _word_
of the author and title. So your code will need to take into
account titles and names with first words less than 4
characters long. For instance, the work "Foo Bar" by
John Do
-8566
-Original Message-
From: Doran, Michael D [mailto:[EMAIL PROTECTED]
Sent: Tuesday, January 11, 2005 3:13 PM
To: perl4lib@perl.org
Subject: RE: Ignoring Diacritics accessing Fixed Field Data
Hi Jane,
These answers assume that the data you are processing:
1) is encoded in the MARC
an, Michael D
> Sent: Tuesday, January 11, 2005 2:13 PM
> To: perl4lib@perl.org
> Subject: RE: Ignoring Diacritics accessing Fixed Field Data
>
> Hi Jane,
>
> These answers assume that the data you are processing:
> 1) is encoded in the MARC-8 character set, and
>
On Tuesday, January 11, 2005 2:13 PM, Michael Doran wrote:
>Assuming that you asking how to strip out the MARC-8 combining diacritic
characters, try inserting the substitution >commands listed (as shown below)
just prior to the substr commands:
>> my $ME = $field->subfield('a');
>
Hi Jane,
These answers assume that the data you are processing:
1) is encoded in the MARC-8 character set, and
2) consists of the MARC-8 default basic and extended Latin characters.
> Dave,Ayod\2003
> Paòt,Kaâs\2002
> Baks,Dasa\2003
> ,Viâs\2002
>
> Problem 1: As you can see, I don't really want
> Problem 1: As you can see, I don't really want the first four
> characters, I want the first four SEARCHABLE characters. How can I
> tell MARC Record to give me the first four characters, excluding
> diacritics?
What output would you have rather seen?
Dave,Ayod\2003
Paot, Kaas\2002
Hi Jane:
On Tue, Jan 11, 2005 at 01:29:55PM -0500, Jacobs, Jane W wrote:
> My result was something like:
>
> Dave,Ayod\2003
> Paòt,Kaâs\2002
> Baks,Dasa\2003
> ,Viâs\2002
>
> Problem 1: As you can see, I don't really want the first four characters, I
> want the first four SEARCHABLE characters.