Re: Invalid UTF-8 characters causing MARC::Record crash.

2011-05-17 Thread Mike Barrett
Al, Your code worked like a charm. Ran the entire test data set (about 1,000 bibs) w/out a problem. Dave, Being a Perl novice, I decided to forgo LocalOverride. Your description scared me off. :) I'll make that an adventure for another day. Now I can start on my real project: Mining bib reco

Re: OAI::Harvester installation help

2011-05-17 Thread Thomas Krichel
Dave Sherohman writes > Hey, all! Long-time Perl programmer, but new to the world of libraries, > so I'm not all that familiar with all the data formats used in these > parts. > > I am attempting to use some code which depends on Net::OAI::Harvester, > but my attempts to install OAI::Harvester

Re: Invalid UTF-8 characters causing MARC::Record crash.

2011-05-17 Thread Dave Sherohman
On Tue, 2011-05-17 at 07:27 -0700, Al wrote: > For me I've found the best solution is to leave Encode.pm > alone and redefine the offending subroutine within my processing script. I > paste this in at the bottom of every script: I always feel a little funny about promoting my own modules, but th

Re: Invalid UTF-8 characters causing MARC::Record crash.

2011-05-17 Thread Al
>Anybody ever see this before? All. The. Time. When I use Encode.pm version 2.12 I don't have this problem. But it occurs repeatedly with version 2.40. There are a few different solutions, but I'm assuming, like me, that it's not practical for you to clean up your MARC records *before* you t

Re: Invalid UTF-8 characters causing MARC::Record crash.

2011-05-17 Thread Ashley Sanders
Hi, > I'm using MARC::Batch and MARC::Field to iterate through a text file of > bibliographic records from Voyager. > > The unrecoverable error is actually occurring in the Perl Unicode module > which is, of course, called by MARC::Record. > It's running into "invalid UTF-8 character 0xC2." > Whe

Invalid UTF-8 characters causing MARC::Record crash.

2011-05-17 Thread Mike Barrett
I'm using MARC::Batch and MARC::Field to iterate through a text file of bibliographic records from Voyager. The unrecoverable error is actually occurring in the Perl Unicode module which is, of course, called by MARC::Record. It's running into "invalid UTF-8 character 0xC2." When I looked up the U