Can someone suggest a way to identify if a MARC record, coded at LDR/09 = ‘a’
has non-unicode characters in it? I tried the following, kind of grasping at
straws, against a record that I know has non-unicode characters. It didn’t
report any errors.
# $bib_id is defined as 001 field
my $bib_marc = [subroutine defined elsewhere to get a marc record
string];
eval {
$bib_rec = MARC::Record->new_from_usmarc($bib_marc);
} ;
if ($@) {
print ERRORS "$bib_id\t$@\n";
next;
}
We have a group of records in our database that are mostly Unicode but have
some erroneous characters. I’d like to have a script to run against them to see
if they’ve been completely cleaned up after the catalogers work on them.
Anne L. Highsmith
Director of Consortia Systems
Texas A&M University
5000 TAMU
College Station, TX 77843-5000
Phone: 979 862 4234
Fax: 979 845 6238
Email: [email protected]