RE: Splitting a large file of MARC records into smaller files

2010-01-25 Thread Houghton,Andrew
> From: Nolte, Jennifer [mailto:jennifer.no...@yale.edu] > Sent: Monday, January 25, 2010 09:48 AM > To: perl4lib@perl.org > Subject: Splitting a large file of MARC records into smaller files > > Hello- > > I am working with files of MARC records that are over a million records > each. I'd like t

RE: How to convert from ANSEL/MARC-8 to UTF-8?

2009-01-07 Thread Houghton,Andrew
> From: Galen Charlton [mailto:galen.charl...@liblime.com] > Sent: Wednesday, January 07, 2009 11:47 AM > To: Michael Lackhoff > Cc: perl4lib@perl.org > Subject: Re: How to convert from ANSEL/MARC-8 to UTF-8? > > On Wed, Jan 7, 2009 at 11:42 AM, Michael Lackhoff > wrote: > > diakritics + base cha

RE: Excel to MARC

2008-11-12 Thread Houghton,Andrew
> From: Yan Liao [mailto:[EMAIL PROTECTED] > Sent: Wednesday, November 12, 2008 12:33 PM > To: perl4lib@perl.org > Subject: Excel to MARC > > Does any one know how to trans data from Excel to MARC? There is a rush > local project and I don't have time to review my PERL class notes. Are you lookin

RE: Working around a UTF8/Unicode encoding problem

2007-05-14 Thread Houghton,Andrew
> From: Jason Ronallo [mailto:[EMAIL PROTECTED] > Sent: 12 May, 2007 16:52 > To: William Denton > Cc: perl4lib@perl.org > Subject: Re: Working around a UTF8/Unicode encoding problem > > I can also see that this record is broken > because the XML entity ' is in a MARC communications format file.

RE: MARC Records, XML, and encoding

2006-05-19 Thread Houghton,Andrew
> From: Joshua Ferraro [mailto:[EMAIL PROTECTED] > Sent: 19 May, 2006 13:40 > To: Edward Summers > Cc: perl4lib > Subject: Re: MARC Records, XML, and encoding > > Hi all, > > Here is an OCLC record: > > http://liblime.com/public/oclc1.dat > > So ... any suggestions for tracking down this probl

RE: yet another character encoding question

2005-09-29 Thread Houghton,Andrew
> -Original Message- > From: Thomale, J [mailto:[EMAIL PROTECTED] > Sent: 29 September, 2005 11:05 > To: perl4lib@perl.org > Subject: RE: yet another character encoding question > > Right, that was my plan. Since latin-1 to UTF-8 isn't > difficult to do (using utf8::encode()), I figured

RE: Use of encode([$encoding]) in MARC-XML

2005-09-27 Thread Houghton,Andrew
> -Original Message- > From: Edward Summers [mailto:[EMAIL PROTECTED] > Sent: 27 September, 2005 10:36 > To: perl4lib@perl.org > Subject: Re: Use of encode([$encoding]) in MARC-XML > > On Sep 27, 2005, at 7:29 AM, Sperr, Edwin wrote: > > I'm attempting to use XSL (on a Windows server) to

RE: utf8 to marc8 conversion

2005-09-15 Thread Houghton,Andrew
> -Original Message- > From: Huw Jones [mailto:[EMAIL PROTECTED] > Sent: Thursday, September 15, 2005 6:40 AM > To: perl4lib > Subject: utf8 to marc8 conversion > > Dear all, > > Is there a way of converting marc files from utf8 to Marc 8 > in perl? We are trying to automate the extract

RE: Corrupt MARC records

2005-05-07 Thread Houghton,Andrew
Most MARC utilities like MARC::Record depend upon the actual directory lengths and having well formed structure. Isn't that what standards are for? But sometimes you really do get badly formed MARC records and need to recover the data. The presented code does have two caveats, which I point

RE: Corrupt MARC records

2005-05-07 Thread Houghton,Andrew
e top of my head and not tested, didn't I? It's still not test, but the above mistakes were obvious after reading what I sent... Andy. -Original Message- From: Houghton,Andrew [mailto:[EMAIL PROTECTED] Sent: Saturday, May 07, 2005 10:58 AM To: perl4lib@perl.org Subject: RE: Cor

RE: Corrupt MARC records

2005-05-07 Thread Houghton,Andrew
MARC records contain a field delimiter after each field and a record delimiter at the end. Assuming that those delimiters are still in your MARC records and that the directory entries are in the same order as the fields, then you can do the following: 1 Set Perl's record delimiter to the MAR

RE: French translation for MARC tag descriptions

2005-04-25 Thread Houghton,Andrew
I should point out that since Canada is using MARC-21, it is possible that Library Archives Canada might have the same information translated into French. So take a look at their site. Andy. > -Original Message- > From: Houghton,Andrew [mailto:[EMAIL PROTECTED] > Sent:

RE: French translation for MARC tag descriptions

2005-04-25 Thread Houghton,Andrew
The unavailable French translation will be at: http://www.loc.gov/marc/marcfre.html The completed Spanish translation is at: http://www.loc.gov/marc/marcspa.html Andy. > -Original Message- > From: Christensen, David A. (CHT) [mailto:[EMAIL PROTECTED] > Sent: 25 April, 2005 11:45 > T

RE: MARC::Record and UTF-8

2005-01-07 Thread Houghton,Andrew
> From: Ed Summers [mailto:[EMAIL PROTECTED] > Sent: 07 January, 2005 09:56 > To: perl4lib@perl.org > Subject: Re: MARC::Record and UTF-8 > > On Fri, Jan 07, 2005 at 08:13:08AM -0500, Houghton,Andrew wrote: > > This is not a Perl solution, but if you are just looki

RE: MARC::Record and UTF-8

2005-01-07 Thread Houghton,Andrew
>From: Ron Davies [mailto:[EMAIL PROTECTED] >Sent: Friday, January 07, 2005 2:54 AM >Subject: Re: MARC::Record and UTF-8 > >At 07:50 7/01/2005, [EMAIL PROTECTED] wrote: >>Does anyone know of any work underway to adapt MARC::Record for utf-8 >>encoding ? > >I will have a similar project in a few m

RE: Warnings during decode() of raw MARC

2004-08-18 Thread Houghton,Andrew
> From: Bryan Baldus [mailto:[EMAIL PROTECTED] > Sent: 18 August, 2004 09:24 > Subject: Warnings during decode() of raw MARC > > I'm probably missing something obvious, but I have been > unsuccessful in trying to capture the warnings reported by > MARC::Record that are set by MARC::File::USMAR

RE: Filing-rules sort subroutine for authors' names?

2004-07-26 Thread Houghton,Andrew
> From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] > Sent: 26 July, 2004 13:58 > Subject: Re: Filing-rules sort subroutine for authors' names? > > Definitely possible--library automation systems and card > printing systems do it. > > I'm not fully conversant with the rules myself, but can te

RE: Displaying diacritics in a terminal vs. a browser

2004-07-01 Thread Houghton,Andrew
> From: Paul Hoffman [mailto:[EMAIL PROTECTED] > Sent: 01 July, 2004 11:57 > Subject: Re: Displaying diacritics in a terminal vs. a browser > > Unless I'm very much mistaken, Chris's code is outputting > UTF-8 to the terminal, not MARC-8. > >> From: Christopher Morgan [mailto:[EMAIL PROTECTED]

RE: Displaying diacritics in a terminal vs. a browser

2004-07-01 Thread Houghton,Andrew
> From: Christopher Morgan [mailto:[EMAIL PROTECTED] > Sent: 01 July, 2004 10:50 > Subject: Displaying diacritics in a terminal vs. a browser > > I use the $cs->to_utf8 conversion from MARC::Charset to > display MARC Authority records in a browser, and the > diacritics display properly there. >

RE: NACO Normalization and Text::Normalize

2003-08-27 Thread Houghton,Andrew
From: Brian Cassidy [mailto:[EMAIL PROTECTED] Subject: RE: NACO Normalization and Text::Normalize > * normalize() > > inputs: either a MARC::Record object or a string. This should probably > accept an arbitrary number of inputs so, you can do > * compare() > > inputs: either two M::R objects or t