Re: MARC::Record and UTF-8 & related threads

Ed Summers Mon, 07 Mar 2005 06:37:34 -0800

On Fri, Mar 04, 2005 at 09:18:00AM -0500, Anne L. Highsmith wrote:
> Here's my main question -- is that the principal
> concern/question/problem, i.e. that directory lengths will not be
> computed correctly using the existing MARC::Record module with a
> Unicode record? Or is it only in certain situations that the directory
> length would not be computed correctly?


Yes, but only if the record actually contains unicode :) If you are
looking for an example of how MARC::Record breaks when there is utf8 
in the record you can look at t/utf8.t which is a test distributed with
the MARC-Record package. Currently, this test is skipped because otherwise 
it would fail.

> If anyone is inspired to make the necessary updates to the MARC::Record 
> module to handle unicode records, I'd certainly be happy to test. I'd also be 
> eternally grateful, since my alternative might be re-writing 8 or 10 job 
> streams in the next 10 weeks so that I can: 1) export the records from my 
> database in MARC8; 2) edit them; 3) reload them doing a MARC8-Unicode 
> conversion utility provided by the lms vendor.

I've been meaning to write to the list about this for sometime now. How
would people feel about the next version of MARC-Record (perhaps a
v2.0) which handled utf8 properly and required a modern perl? By modern
perl I mean a version >= 5.8.1. The reason why 5.8.1 is required is that
it's the first perl with a byte oriented substr() (available via the
bytes pragma).

Perhaps if people could respond to the list (or me if you prefer) with
the version of Perl that you use MARC::Record with I could keep tallies
and report back to the list.

//Ed

Re: MARC::Record and UTF-8 & related threads

Reply via email to