Dumb question about utf-8 handling

Gene Damon Thu, 15 Mar 2007 12:29:46 -0800

Been using marc.pm for some time to do a "bunch-o-stuff," but haven'thad to deal with unicode problems until now. Couple of basic questions.

1. If I use the "$record->encoding()" call, is it simply checking forthe "a" in leader position 9? That is, it is not doing anything morecomplicated, like checking the actual data.

(Reason for the question, if leader position 9 is wrong, and encodingdepends on it, then encoding will not tell me anything more than marc.pmthinks that it is unicode.)

2. If I want to update a record that is unicode do I have to make sureany text string that I send in say, append_fields(... has been encodedas utf8, i.e., use some perl function to ensure that the string(s) areutf8? Alternatively, does marc.pm handle that automatically when itthinks it has a utf8 record?

(This problem comes from working with a set of records. When I updatethose that have leader byte 9 set to "a," and I try to do something withthe updated record, e.g., read it, I crash with the "does not map to..." message. That leads me to assume that either I need to dosomething with the stuff I am adding to make it utf8 or I have recordsthat say they are utf8 and are not. Running some of the records througha little check program I have, usually finds a problem with the recordlength when it hits these records that were changed.)


--
Gene Damon
Director of Library Automation &
Learning Resources
Virginia Community College System
(804) 819-4981 (voice)
(804) 819-4773 (fax)
[EMAIL PROTECTED]
Address:
James Monroe Bldg.
15th Floor
101 N. 14th St.
Richmond, VA 23219

http://library.vccs.edu/reference

Dumb question about utf-8 handling

Reply via email to