MARC::Record 2.0.7 and MARC::File::XML 1.0.5 released

2017-05-23 Thread Galen Charlton
Hi, I have uploaded MARC::Record 2.0.7 and MARC::File::XML 1.0.5 to CPAN. Both are minor bugfix and packaging update releases. Here are the changes to MARC::Record: 2.0.7 Tue May 23 20:41:13 EDT 2017 [FIXES] - RT#108123: clean up MANIFEST.SKIP - GH#1: marcdump now prints

Σχετ: MARC::File::XML 1.0.3 released

2014-01-29 Thread dasos ili
greetings to you all, could you please give me a hint on how you treat some long waiting processes? For example we are trying to upload a MARC file and then index it via a web interface, but the whole procedure takes a lot, or we have no idea via dancer front end how is it going, how could we b

MARC::File::XML 1.0.3 released

2014-01-22 Thread Galen Charlton
Hi, I have uploaded MARC::File::XML 1.0.3 to CPAN. This release fixes a test failure on Strawberry Perl. Here is the changelog: 1.0.3 Thu Jan 23 05:02:52 UTC 2014 - fix t/external-entities.t to pass on Strawberry Regards, Galen -- Galen Charlton gmcha...@gmail.com

Re: SECURITY release: MARC::File::XML 1.0.2

2014-01-22 Thread Dan Scott
On Tue, Jan 21, 2014 at 12:38 PM, Galen Charlton wrote: > Hi, > > I have uploaded [1] version 1.0.2 of MARC::File::XML. This is a > security release that repairs an XML external entity (XXE) > vulnerability. I recommend that all uses of MARC::File::XML upgrade > promptly. >

SECURITY release: MARC::File::XML 1.0.2

2014-01-21 Thread Galen Charlton
Hi, I have uploaded [1] version 1.0.2 of MARC::File::XML. This is a security release that repairs an XML external entity (XXE) vulnerability. I recommend that all uses of MARC::File::XML upgrade promptly. Here is the change log entry: 1.0.2 Tue Jan 21 17:18:37 UTC 2014 - MARC::File

MARC::File::XML 1.0.1

2013-02-15 Thread Galen Charlton
that there's a fair amount of clients of MARC::File::XML that depended on an empty MARC::Record being returned when given an empty record element. If your latest package is of 0.93, you may want to hold off for a bit, as I have another release planned that will fix a long-standing bug

MARC::File::XML 1.0

2013-02-10 Thread Galen Charlton
Hi, I have uploaded version 1.0 of MARC::File::XML to CPAN. Changes from version 0.93 are: 1.0 Sun Feb 10 21:32:32 PST 2013 - Use XML::LibXML rather than XML::SAX to parse MARCXML records. This offers a significant speed improvement for record parsing; in some tests

RE: MARC::Record / MARC::File::XML bug when fields contain newlines?

2012-01-12 Thread Jacobs, Jane W
; } If it works for you, great! And thanks to the original source, who I would acknowledge, if I knew who it was. JJ -Original Message- From: Reese, Terry [mailto:terry.re...@oregonstate.edu] Sent: Thursday, January 12, 2012 12:03 PM To: perl4lib@perl.org Subject: MARC::Record / MAR

RE: MARC::Record / MARC::File::XML bug when fields contain newlines?

2012-01-12 Thread Bryan Baldus
date them with MARCEdit. ... >Looks like MARC::Record is doing its job correctly. Perhaps changing >MARC::File::XML is in order. MARC::File::USMARC includes a line in sub _next: # remove illegal garbage that sometimes occurs between records $usmarc =~ s/^[ \x00\x0a\x0d\x1a]+//; If I r

Re: MARC::Record / MARC::File::XML bug when fields contain newlines?

2012-01-12 Thread arvinport...@lycos.com
nd 0D as valid. However I hunted down a more official version of the map here: http://lcweb2.loc.gov/diglib/codetables/42.html And sure enough neither 0A or 0D is listed. Learn something new and all that. Looks like MARC::Record is doing its job correctly. Perhaps changing MARC::File::XML

MARC::Record / MARC::File::XML bug when fields contain newlines?

2012-01-12 Thread Reese, Terry
AM To: perl4lib@perl.org Subject: perl4lib Digest 12 Jan 2012 16:31:25 - Issue 479 perl4lib Digest 12 Jan 2012 16:31:25 - Issue 479 Topics (messages 2952 through 2953): MARC::Record / MARC::File::XML bug when fields contain newlines? 2952 by: arvinporthog.lycos.com 2953 by

Re: MARC::Record / MARC::File::XML bug when fields contain newlines?

2012-01-12 Thread Jon Gorman
On Wed, Jan 11, 2012 at 3:35 PM, arvinport...@lycos.com wrote: > I've been converting MARC XML records into USMARC and recently had a slew of > bad records which MARCEdit reported as having invalid leaders. After a few > days of puzzling over this and blaming it all on Unicode I noticed they wer

MARC::Record / MARC::File::XML bug when fields contain newlines?

2012-01-11 Thread arvinport...@lycos.com
MARC21/slim";> 06965nam a2202005 u 4500 Theoretical and Technological Aspects of Crystal Growth (If your mail viewer mangles lines, there's a hard return (0D 0A) after the word Technological in the 245) Here is my test program which illustrates the pro

Fixing MARC::File::XML parser failure

2011-11-29 Thread Xan Charbonnet
), then I can never use new_from_xml again in that run of my script. Instead I get a "parser is already parsing" error. I've made a small change to MARC::File::XML to work around this. In the "decode" function, instead of: $parser->parse_string( $text ); I now re-creat

Re: MARC::File::XML utf8 output problem

2011-03-08 Thread Saiful Amin
> > However, when I try to generate MARCXML output with following code the >> > Arabic characters gets corrupted: >> >> > $xml = MARC::File::XML->out( $file ); >> > my $record = MARC::Record->new(); >> > ... >> > $xml->write( $rec

Re: MARC::File::XML utf8 output problem

2011-03-08 Thread Saiful Amin
> > However, when I try to generate MARCXML output with following code the > > Arabic characters gets corrupted: > > > $xml = MARC::File::XML->out( $file ); > > my $record = MARC::Record->new(); > > ... > > $xml->write( $record ); > > ... >

Re: MARC::File::XML utf8 output problem

2011-03-08 Thread Frédéric DEMIANS
> However, when I try to generate MARCXML output with following code the > Arabic characters gets corrupted: > $xml = MARC::File::XML->out( $file ); > my $record = MARC::Record->new(); > ... > $xml->write( $record ); > ... > $xml->close(); > I'v

MARC::File::XML utf8 output problem

2011-03-08 Thread Saiful Amin
Hi, I'm trying to create MARCXML records with data extracted from MySQL, in which some of the records are in Arabic. I'm using MARC::Record (2.0.3), MARC::File::XML (0.93) and DBD::mysql (4.012). The output into ISO2709 shows the Arabic utf8 strings correctly with following code snip

MARC::File::XML 0.93

2011-02-11 Thread Galen Charlton
Hi, I have uploaded version 0.93 of MARC::File::XML to CPAN. This is a minor bugfix release: 0.93 Fri Feb 11 17:13:02 EST 2011 - When slurping MARCXML records (e.g., via MARC::Batch), can now handle XML files that use a prefix to refer to the http://www.loc.gov/MARC21

Re: Problem with MARC::File::XML

2010-12-20 Thread Claire Hernandez
Hi, First, when the patch is not applied, in both case, I have no error. There are 2 cases: 1/ $ more /etc/perl/XML/SAX/ParserDetails.ini [XML::SAX::PurePerl] http://xml.org/sax/features/namespaces = 1 [XML::LibXML::SAX] http://xml.org/sax/features/namespaces = 1 [XML::LibXML::SAX::Parser] http

Re: Problem with MARC::File::XML

2010-12-15 Thread Galen Charlton
Hi, On Tue, Dec 14, 2010 at 11:55 AM, Claire Hernandez wrote: > I had another issue, with another sax parser: >    Malformed UTF-8 character (unexpected end of string) in substitution > (s///) at /usr/lib/perl5/XML/LibXML/Error.pm line 217. >    problem with :84 : :61: parser error : Input is not

Re: Problem with MARC::File::XML

2010-12-14 Thread Claire Hernandez
ire; From: *Galen Charlton* Date: December 13, 2010 06:16 Subject: Re: Problem with MARC::File::XML Message ID: aanlktinutec98fzq4dzz=t6hlvvyqe0nq4=uzepqp...@mail.gmail.com <http://www.nntp.perl.org/group/perl.perl4lib/;msgid=aanlktinutec98fzq4dzz=t6hlvvyqe0nq4=uzepqp...@mail.gmail.com> Hi, On

Re: Problem with MARC::File::XML

2010-12-13 Thread Galen Charlton
Hi, On Thu, Dec 2, 2010 at 5:09 AM, LAURENT Henri-Damien wrote: > we are currently trying to use MARC::File::XML on multi threaded > decoding of records. > We are encountering erratic error of Parsing. >  problem with :25 : > not well-formed (invalid token) at line 32, column 2

Problem with MARC::File::XML

2010-12-02 Thread LAURENT Henri-Damien
Hi we are currently trying to use MARC::File::XML on multi threaded decoding of records. We are encountering erratic error of Parsing. problem with :25 : not well-formed (invalid token) at line 32, column 27, byte 1216 at /usr/lib/perl5/XML/Parser.pm line 187 http://www.w3.org/2001/XMLSchema

Re: DOM parser for MARC::File::XML

2010-11-21 Thread Frédéric DEMIANS
Any testing would be gratefully appreciated. I've run some tests. Parsing 1000 MARCXML records gives me this result: - with the current SAX handler: 8.21954107284546 - with new DOM method: 6.30712199211121 Thanks. -- Frédéric

DOM parser for MARC::File::XML

2010-11-14 Thread Galen Charlton
ementation even when using XML::ExpatNS as the SAX driver (details of the testing can be found in two emails to koha-devel [2], [3]). Since MARC::File::XML is presently geared towards parsing a single record at a time, memory consumption shouldn't be much of an issue. Any testing woul

Re: Moving to Google Code/svn (was Re: [Patch] Escape marc tag/code/indicators in Marc::File::XML)

2010-03-22 Thread Galen Charlton
Hi, On Tue, Mar 16, 2010 at 11:13 PM, Galen Charlton wrote: > If there are no major objections, in a week's time I plan to make the > CVS repo read-only and we'll move forward with Git. I've now removed write access to the CVS repository, and will disable CVS entirely once I see that people are

Re: Moving to Google Code/svn (was Re: [Patch] Escape marc tag/code/indicators in Marc::File::XML)

2010-03-16 Thread Ed Summers
On Tue, Mar 16, 2010 at 11:13 PM, Galen Charlton wrote: > If there are no major objections, in a week's time I plan to make the > CVS repo read-only and we'll move forward with Git. Hooray, thanks so much Galen! Sounds like a great plan moving forward. //Ed

Re: Moving to Google Code/svn (was Re: [Patch] Escape marc tag/code/indicators in Marc::File::XML)

2010-03-16 Thread Galen Charlton
Hi, On Tue, Jan 26, 2010 at 11:57 AM, Ed Summers wrote: > I would argue for keeping stuff on sf.net ... and leapfrogging svn for > git if we can. To that end, I've enabled Git in the marcpm SourceForge project and established a new repository which contains the history from the five CVS modules

Re: Moving to Google Code/svn (was Re: [Patch] Escape marc tag/code/indicators in Marc::File::XML)

2010-01-26 Thread Galen Charlton
Hi, On Tue, Jan 26, 2010 at 10:57 AM, Ed Summers wrote: > I would argue for keeping stuff on sf.net ... and leapfrogging svn for > git if we can. Both suggestions would be perfectly fine with me. Bryan, for your reference, TortoiseGit [1] seems to be a well-supported Windows client for Git. [1

Re: Moving to Google Code/svn (was Re: [Patch] Escape marc tag/code/indicators in Marc::File::XML)

2010-01-26 Thread Ed Summers
I would argue for keeping stuff on sf.net ... and leapfrogging svn for git if we can. //Ed On Tue, Jan 26, 2010 at 10:45 AM, Galen Charlton wrote: > Hi, > > On Wed, Jul 22, 2009 at 7:53 PM, Dueber, William wrote: >> Speaking for myself only, of course: Please, for the love of god, move away >>

Moving to Google Code/svn (was Re: [Patch] Escape marc tag/code/indicators in Marc::File::XML)

2010-01-26 Thread Galen Charlton
Hi, On Wed, Jul 22, 2009 at 7:53 PM, Dueber, William wrote: > Speaking for myself only, of course: Please, for the love of god, move away > from CVS. Moving the code to a more modern VCS and a well-supported host. The > least-painful move might be to SVN on Google Code. Picking up on this thre

MARC::File::XML 0.92 released

2009-07-31 Thread Galen Charlton
Hi, I've uploaded 0.92 of MARC::File::XML to CPAN. The only changes since 0.91 are to POD and packaging, including adding back a copyright statement in the README to satisfy Debian requirements. From the change log: 0.92 Thu Jul 30 22:37:07 EDT 2009 - small documentation changes to

MARC::File::XML 0.91 released

2009-07-30 Thread Galen Charlton
Hi, I have uploaded version 0.91 of MARC::File::XML to CPAN. This release also includes changes to its sibling module MARC:File::SAX. Changes from 0.88 include: 0.91 Tue Jul 28 2009 - Nixing the stray space (always been there) in the schemaLocation attribute for single-record

Re: Ready for MARC::File::XML release? (was [Patch] Escape marc tag/code/indicators in Marc::File::XML)

2009-07-26 Thread Ed Summers
; > Any objections to my pushing out 0.91 as a bugfix release?  I've > applied Bill's patch and addressed the one CPAN bug against 0.88.  Any > more patches still sitting on the floor? > > While I intend to go over MARC::File::XML with a fine-toothed comb > some time in the

Ready for MARC::File::XML release? (was [Patch] Escape marc tag/code/indicators in Marc::File::XML)

2009-07-26 Thread Galen Charlton
bug against 0.88. Any more patches still sitting on the floor? While I intend to go over MARC::File::XML with a fine-toothed comb some time in the next few weeks, I figure that that the results will be better targeted to a 1.0 release. Regards, Galen

Re: [Patch] Escape marc tag/code/indicators in Marc::File::XML

2009-07-26 Thread Galen Charlton
Hi, On Wed, Jul 22, 2009 at 5:04 PM, Dan Scott wrote: > Was it decided that Bill's escaping output patch would be dropped? I > don't see it in CVS yet. I've committed Bill's patch and a test case based on the sample file he sent. Thanks, Bill! Regards, Galen

Re: [Patch] Escape marc tag/code/indicators in Marc::File::XML

2009-07-22 Thread Dan Scott
2009/7/22 Galen Charlton : > Hi, > > Nope, just haven't gotten around to pushing it.  Soon, I promise! > > Funny you should mention CVS.  I have a general question for the > MARC/Perl hackers: Ed mentioned a while back moving from CVS to a more > modern VCS such as Subversion or (my preference) Git

Re: [Patch] Escape marc tag/code/indicators in Marc::File::XML

2009-07-22 Thread Dueber, William
Speaking for myself only, of course: Please, for the love of god, move away from CVS. Moving the code to a more modern VCS and a well-supported host. The least-painful move might be to SVN on Google Code. I'm pretty sure there's nothing but CVS available for OS9, though, so the question has to

RE: [Patch] Escape marc tag/code/indicators in Marc::File::XML

2009-07-22 Thread Bryan Baldus
On Wednesday, July 22, 2009 4:10 PM, Galen Charlton [galen.charl...@liblime.com] wrote: >Funny you should mention CVS. I have a general question for the MARC/Perl hackers: Ed mentioned a while back moving from CVS to a more modern VCS such as Subversion or (my preference) Git. I'm willing to do

Re: [Patch] Escape marc tag/code/indicators in Marc::File::XML

2009-07-22 Thread Galen Charlton
Hi, Nope, just haven't gotten around to pushing it. Soon, I promise! Funny you should mention CVS. I have a general question for the MARC/Perl hackers: Ed mentioned a while back moving from CVS to a more modern VCS such as Subversion or (my preference) Git. I'm willing to do the legwork to get

Re: [Patch] Escape marc tag/code/indicators in Marc::File::XML

2009-07-22 Thread Dan Scott
Was it decided that Bill's escaping output patch would be dropped? I don't see it in CVS yet. It would be nice to see the 0.91 release get pushed out the door, in any case. 0.88 was a long time ago. Dan 2009/4/14 Galen Charlton : > Hi, > > On Tue, Apr 14, 2009 at 3:57 PM, Dan Scott wrote: >> 20

Re: [Patch] Escape marc tag/code/indicators in Marc::File::XML

2009-04-14 Thread Galen Charlton
Hi, On Tue, Apr 14, 2009 at 3:57 PM, Dan Scott wrote: > 2008/10/29 Bill Erickson : > Is anyone planning on applying this patch? It would be a shame to drop > it on the floor. I'll take a look at it and apply it, unless somebody beats me to the punch in the next day or two. Regards, Galen -- G

Re: [Patch] Escape marc tag/code/indicators in Marc::File::XML

2009-04-14 Thread Dan Scott
2008/10/29 Bill Erickson : > Hi all, > > I ran across some gnarly MARC data today, which contained, among other > things, MARC codes of "<".  I realized that Marc::File::XML outputs the MARC > tags, codes, and indicators without escaping them.  This results, in my

Re: [Patch] Escape marc tag/code/indicators in Marc::File::XML

2008-10-29 Thread Bill Erickson
On Wed, Oct 29, 2008 at 5:01 PM, Dan Scott <[EMAIL PROTECTED]> wrote: > 2008/10/29 Bill Erickson <[EMAIL PROTECTED]>: > > Hi all, > > > > I ran across some gnarly MARC data today, which contained, among other > > things, MARC codes of "<".

Re: [Patch] Escape marc tag/code/indicators in Marc::File::XML

2008-10-29 Thread Dan Scott
2008/10/29 Bill Erickson <[EMAIL PROTECTED]>: > Hi all, > > I ran across some gnarly MARC data today, which contained, among other > things, MARC codes of "<". I realized that Marc::File::XML outputs the MARC > tags, codes, and indicators without escaping the

[Patch] Escape marc tag/code/indicators in Marc::File::XML

2008-10-29 Thread Bill Erickson
Hi all, I ran across some gnarly MARC data today, which contained, among other things, MARC codes of "<". I realized that Marc::File::XML outputs the MARC tags, codes, and indicators without escaping them. This results, in my case, in invalid XML like: France It seems re

Requesting help with simple MARC::File::XML program -- SOLVED

2008-07-29 Thread Anne L. Highsmith
Thanks to suggestions from Michael Kreyche & Bill Dueber, I have been able to solve the problem with using the "file" command on my usmarc to marcxml program. It was the perl version. We were running on 5.8.7, which threw the error. Once we upgraded to 5.8.8, program ran just fine.

Requesting help with simple MARC::File::XML program

2008-07-20 Thread Anne L. Highsmith
I have a file of usmarc records which I want to read into a program and print to a file as MARC xml. Here's my program so far: # #!/usr/local/bin/perl use strict; use warnings; use MARC::Record; use MARC::Batch; use MARC::File::XML

Re: problem with MARC::File::XML on RH 5 64bits [solved]

2007-12-18 Thread paul POULAIN
Ed Summers a écrit : It looks like you don't have an XML parser installed that supports the features that M::F::X requires: use XML::SAX qw(Namespaces Validation); Try executing that, and see if you get a similar exception. FWIW Namespace support is required for the version of MARC::File::SAX

Re: problem with MARC::File::XML on RH 5 64bits

2007-12-17 Thread Ed Summers
It looks like you don't have an XML parser installed that supports the features that M::F::X requires: use XML::SAX qw(Namespaces Validation); Try executing that, and see if you get a similar exception. FWIW Namespace support is required for the version of MARC::File::SAX that is in CVS since i

problem with MARC::File::XML on RH 5 64bits

2007-12-17 Thread paul POULAIN
ems I have a problem with M::F::X : [EMAIL PROTECTED] MARC-XML-0.88]# make test PERL_DL_NONLAZY=1 /usr/bin/perl "-MExtUtils::Command::MM" "-e" "test_harness(0, 'blib/lib', 'blib/arch')" t/*.t t/batch # Failed test 'use MARC

MARC::File::XML 0.88

2007-11-29 Thread Joshua Ferraro
Hi all, Just a quick announcement regarding an update of MARC::File::XML to version 0.88. From the Changelog: 0.88 Wed Nov 28 2007 - String test for subfield code to avoid dropping $0 (Galen Charlton) 0.88_1 Tue Oct 23 2007 - Fixed a typo in M::F::X that could be the origin of the

Re: MARC::File::XML and parsing.

2007-10-19 Thread Mike Rylander
o exactly what you want. You can either pass 'use' flags to M::F::X to set the default: use MARC::File::XML ( BinaryEncoding => 'utf8', RecordFormat => 'UNIMARC' ); or, if you're reading the XML by hand one record at a time, you can use new_from_xml (after

Re: MARC::File::XML and parsing.

2007-10-03 Thread Henri-Damien LAURENT
Doran, Michael D a écrit : > Hi Henri, > > >> Is there a reason why MARC::File::XML considers only a very >> strict subset of utf-8 as valid ? >> > > I would guess that it has to do with adhering to the MARC-21 repertoire of > characters, so as to

RE: MARC::File::XML and parsing.

2007-09-27 Thread Doran, Michael D
Hi Henri, > Is there a reason why MARC::File::XML considers only a very > strict subset of utf-8 as valid ? I would guess that it has to do with adhering to the MARC-21 repertoire of characters, so as to facilitate the round-trip conversion between the MARC-8 and Unicode character set

MARC::File::XML odd conversion to usmarc

2007-09-26 Thread Henri-Damien LAURENT
part of a batch. And it seems that trying to decode the marc record on its own leads to problems. Anyway, why is there a change in leader from 00154nx a2200073 4500 to 00162lbána2200073uel4500 ? This leads to problem with MARC decoding. Is there something to do about it ? Coud MARC::File::

MARC::File::XML and parsing.

2007-09-26 Thread Henri-Damien LAURENT
hi, I have some problems with Marc::File::XML parser. Take those two xml records. Despite the fact that I agree that there are odd characters in some subfields. I am wondering why, since those characters are UTF8, MARC::File::XML should drop them when parsing. Is there a reason why MARC::File

Re: wide character in print in MARC::File::XML

2007-09-18 Thread Mike Rylander
On 8/19/07, Rina Ron <[EMAIL PROTECTED]> wrote: > I got the error "wide character in print at IO/Handle.pm line 399" and > solved it in a not nice way by using an object as a hash reference. > Perhaps there is a better way. I have updated CVS with code that checks the default output encoding and s

MARC::File::XML UNIMARC Encoding Problem

2007-08-26 Thread Henri-Damien LAURENT
Hi, I have a problem with new_from_xml. Here is my script : my $record; print $marcxml; $record= MARC::Record->new_from_xml($marcxml,"UTF8","UNIMARC"); warn $record->as_xml; Here is my data : http://www.w3.org/2001/XMLSchema-instance"; xsi:schemaL

wide character in print in MARC::File::XML

2007-08-18 Thread Rina Ron
I got the error "wide character in print at IO/Handle.pm line 399" and solved it in a not nice way by using an object as a hash reference. Perhaps there is a better way. Here is what I have : use MARC::File::XML ( BinaryEncoding => 'utf8', RecordFormat => 'UNI

MARC::File::XML

2007-07-16 Thread Shieh, Jackie
I was wondering why MARC::File::XML did not work with #2 file?! 1: a single record: http://www.loc.gov/standards/marcxml/Sandburg/sandburg.xml 2: a collection set: http://www.loc.gov/standards/marcxml/xml/collection.xml The differences I noticed of the two are the prefix

Re: MARC::File::XML 0.85

2007-04-16 Thread Ed Summers
I apologize, but I'm finding it hard to trace what exactly this script is doing. I did take a look at the first failure and sure enough the record leader says it's 463 bytes but the record itself is 464 bytes. So a failure is warranted -- given the current behavior of MARC::Record. Perhaps dumbin

Re: MARC::File::XML 0.85

2007-04-16 Thread Paul POULAIN
Joshua M. Ferraro a écrit : - "Mike Rylander" <[EMAIL PROTECTED]> wrote: CVS is updated with that now, and after anyone willing makes sure it's not breaking anything I think we should release again. Just updated and ran make test successfully. I'm happy to roll a release today if there are

Re: MARC::File::XML 0.85

2007-04-16 Thread Dan Scott
With MARC::Charset 0.96, MARC::Record 2.0.0, and MARC::File::XML from CVS, I get failed tests for 7 out of 8 of Joshua's tests. Test STDIN and STDOUT output attached. Dan Scott On 16/04/07, Joshua M. Ferraro <[EMAIL PROTECTED]> wrote: - "Mike Rylander" <[EMAIL PRO

Re: MARC::File::XML 0.85

2007-04-16 Thread Joshua M. Ferraro
- "Mike Rylander" <[EMAIL PROTECTED]> wrote: > CVS is updated with that now, and after anyone willing makes sure > it's not breaking anything I think we should release again. Just updated and ran make test successfully. I'm happy to roll a release today if there are no objections. This may be

Re: MARC::File::XML 0.85

2007-04-13 Thread Mike Rylander
On 4/13/07, Paul POULAIN <[EMAIL PROTECTED]> wrote: Mike Rylander a écrit : > On 4/13/07, Joshua M. Ferraro <[EMAIL PROTECTED]> wrote: > $parser->{ Handler }{ toMARC8 } = (lc($format) =~ /^unimarc/o || ( > $enc && lc($enc) =~ /^utf-?8$/o )) ? 0 : 1; > mmm... I agree with you. In fact, I think h

Re: MARC::File::XML 0.85

2007-04-13 Thread Paul POULAIN
Mike Rylander a écrit : On 4/13/07, Joshua M. Ferraro <[EMAIL PROTECTED]> wrote: $parser->{ Handler }{ toMARC8 } = (lc($format) =~ /^unimarc/o || ( $enc && lc($enc) =~ /^utf-?8$/o )) ? 0 : 1; mmm... I agree with you. In fact, I think having a test that, if true make 0 as result and if false

Re: MARC::File::XML 0.85

2007-04-13 Thread Mike Rylander
On 4/13/07, Joshua M. Ferraro <[EMAIL PROTECTED]> wrote: Hi folks, A new version of MARC::File::XML has been uploaded to CPAN. It's a one character fix that solves a bug for UNIMARC users who don't have MARC-8 encoded records. Thanks to Paul Poulain for pointing it out. Unf

MARC::File::XML 0.85

2007-04-13 Thread Joshua M. Ferraro
Hi folks, A new version of MARC::File::XML has been uploaded to CPAN. It's a one character fix that solves a bug for UNIMARC users who don't have MARC-8 encoded records. Thanks to Paul Poulain for pointing it out. Cheers, -- Joshua Ferraro SUPPORT FOR OPEN-SOURC

small (although important for UNIMARC guys) bug in MARC::File::XML ?

2006-10-11 Thread Paul POULAIN
As I have unimarc records, i'm not happy with that... so I have investigated MARC::File::XML and found at line 448 : > $parser->{ Handler }{ toMARC8 } = (lc($format) ne 'unimarc' && $enc && lc($enc) =~ /^utf-?8$/o) ? 0 : 1; meaning that for "unimarc

Re: MARC::* concern (Re: MARC::File::XML suggestion)

2006-07-14 Thread Thomas Dukleth
On Fri, July 14, 2006 2:17 pm, Edward Summers wrote: > This is all fine, but lets talk in unit tests for MARC::Record if we > can. They will make plain what the actual behavior is ... I was commenting here about my concern raised by the findings of Paul Poulain. I had no test of my own showing an

Re: MARC::* concern (Re: MARC::File::XML suggestion)

2006-07-14 Thread Edward Summers
codes containing non-numeric characters using MARC::Record or MARC::File::XML. However, I have not tested extensively and want to be certain that no problems would not arise from such usage in future. 2. CHARACTER VARIENCE FOR SUBFIELD CODES. On Thu, July 13, 2006 4:41 pm, Paul POULAIN wrote:

MARC::* concern (Re: MARC::File::XML suggestion)

2006-07-13 Thread Thomas Dukleth
MARC::Record or MARC::File::XML. However, I have not tested extensively and want to be certain that no problems would not arise from such usage in future. 2. CHARACTER VARIENCE FOR SUBFIELD CODES. On Thu, July 13, 2006 4:41 pm, Paul POULAIN wrote: > Could it be possible to have an option t

Re: MARC::File::XML suggestion

2006-07-13 Thread Edward Summers
On Jul 13, 2006, at 12:41 PM, Paul POULAIN wrote: sometimes, I parse XML that contains invalid subfieldcode (like a capital letter) M::F::X definetly dies in this case. MARC::Field seems to allow a subfield with a capital letter--as it should since there really is no requirement that subfie

MARC::File::XML suggestion

2006-07-13 Thread Paul POULAIN
Hello all, sometimes, I parse XML that contains invalid subfieldcode (like a capital letter) M::F::X definetly dies in this case. Could it be possible to have an option that deals with invalid subfieldcodes : * die (actual behaviour) * drop (drop the subfield) * ignore (integrate the subfiel

Re: More Fun With MARC::File::XML: Solutions

2006-05-28 Thread Edward Summers
On May 28, 2006, at 2:25 PM, Joshua Ferraro wrote: Maybe a note somewhere in the MARC::File::XML documentation to point these issues out would be useful. Also, it wouldn't be too bad to have a few tests to make sure that the system's default SAX parser is capable of handling these c

Re: More Fun With MARC::File::XML: Solutions

2006-05-28 Thread Joshua Ferraro
Hi everyone, Just providing an update on this issue. As you may recall, I've been putting the MARC::Record suite, specifically MARC::File::XML and MARC::Charset, through some fairly rigourous tests, including a 'roundtrip' test, which converts the binary MARC-8 records to MARCXML /

More Fun With MARC::File::XML

2006-05-23 Thread Joshua Ferraro
Hi everyone, Thanks for all the help thusfar ... Things are running much smoother since I installed XML::SAX::Expat. However, I'm still having a problem I haven't been able to work around. I have the following batch of 5 records: http://liblime.com/public/several.mrc They don't throw any errors

Re: [Koha-zebra] Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-21 Thread Pierrick LE GALL
On Mon, 20 Mar 2006 10:54:08 -0500 "Mike Rylander" <[EMAIL PROTECTED]> wrote: > Except that Perl doesn't know that the data is already UTF8 ... which > is the problem. [...] You're completely right, I understand the difference. We made UTF8 work from MySQL bu we didn't tried to work on data comin

Re: [Koha-zebra] Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-20 Thread Mike Rylander
all PerlIO channels. There really isn't any way around this if you want to claim real UTF8 support and be able to use components that really do support UTF8 natively, like MARC::File::XML and MARC::Record. It's unfortunate that the DBD::mysql people won't fix their module, but

Re: [Koha-zebra] Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-20 Thread Pierrick LE GALL
Hello Mike, I'll answer to the second question, since I worked with Paul on Perl/MySQL and UTF-8... On Mon, 20 Mar 2006 09:59:32 -0500 "Mike Rylander" <[EMAIL PROTECTED]> wrote: > Are you using decode_utf8($mysql_string) to let Perl know that the > database is UTF8 encoded? IIRC, MySQL doesn't

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-20 Thread Paul POULAIN
Mike Rylander a écrit : On 3/20/06, Paul POULAIN <[EMAIL PROTECTED]> wrote: Mike Rylander a écrit : I tested with the record you sent Ed and me, and everything seems to work for me ... As you can see, I tested several variants of the UNIMARC flag, and even tested not sending the encoding to n

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-20 Thread Mike Rylander
On 3/20/06, Paul POULAIN <[EMAIL PROTECTED]> wrote: > Mike Rylander a écrit : > > I tested with the record you sent Ed and me, and everything seems to > > work for me ... > > As you can see, I tested several variants of the UNIMARC flag, and > > even tested not sending the encoding to new_from_xml(

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-20 Thread Paul POULAIN
Mike Rylander a écrit : I tested with the record you sent Ed and me, and everything seems to work for me ... As you can see, I tested several variants of the UNIMARC flag, and even tested not sending the encoding to new_from_xml() ... it all seems to work for me, and I'm not sure what problems yo

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-17 Thread Mike Rylander
I tested with the record you sent Ed and me, and everything seems to work for me ... comparing the preprocessed XML with a copy that I round-tripped through MARC::Record and MARC::File::XML, they look the same. Here's my little test script (unimarc-te

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-17 Thread Paul POULAIN
nstall I updated MARC::record (2.000), MARC::Charset (0.920) and MARC::File::xml (0.810 ???) from sourceforge. I tested with a unimarc xml file (that I've send to you by pm), without change. what did I make wrong ? -- Paul POULAIN et Henri Damien LAURENT Consultants indépendants en logi

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-16 Thread Mike Rylander
Mea culpa ... read on. :) On 3/16/06, Mike Rylander <[EMAIL PROTECTED]> wrote: > I've updated the cvs for MARC::File::XML with what I described below, > with one caveat. The one difference from what I was planning is that, > because as_xml() is generated by MARC::Record,

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-16 Thread Ed Summers
On 3/16/06, Mike Rylander <[EMAIL PROTECTED]> wrote: > Will some brave soul please test this with some UNIMARC records and > let me know how it goes? Yes please, add the test to the test suite if possible Joshua and Paul. miker_++ //Ed

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-16 Thread Mike Rylander
I've updated the cvs for MARC::File::XML with what I described below, with one caveat. The one difference from what I was planning is that, because as_xml() is generated by MARC::Record, I can't give it new parameters. To test exporting to XML you'll need to set the record fo

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-16 Thread Mike Rylander
I've been attempting to beat the MARC::File::XML stuff into a usable shape as of late, so I'm going to take a stab at fixing this. There will be some limitations (at first) as to what encodings we'll accept for UNIMARC records, but I'll cover the cases that I know about (and

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-16 Thread Zeno Tajoli
rc_abrege.pdf, page 8 for 100$a) BIG PROBLEM : MARC::File::XML only checks for position 9, thinking the XML is necessary a marc21 file. I think (& joshua agrees) we will have to hack MARC::File::XML to solve this problem. We have 2 solutions : * add a test to define wether we are UNIMARC or M

Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-16 Thread Paul POULAIN
Hello all, Still working on UNICODE in Koha. We are stuck with a not-so-nice problem. (Many many thanks to the librarians that wrote marc21 and unimarc standards...) I explain : yesterday : joshua "the new marc::file::xml works fine with utf8 now". me : "Great ! i'll g

Re: MARC::File::XML => ed Rocks !

2006-01-10 Thread Ed Summers
On 1/10/06, Paul POULAIN <[EMAIL PROTECTED]> wrote: > Repeat after me : Ed Summers is the best Perl coder for librarians ! Oh shucks, thanks, I *really* wish that was true. The reality is that I wrote that module and was familiar with one of its limitations. The real people who should be praised

Re: MARC::File::XML perfs

2006-01-09 Thread Ed Summers
I should have mentioned that MARC::File::XML uses XML::SAX for XML parsing. XML::SAX can use a variety of backend XML parsers, but I believe by default it will use the XML::SAX::PurePerl parser if it can't find any other ones installed, which is exceptionally slow. I recommend taking a lo

Re: MARC::File::XML perfs

2006-01-09 Thread Ed Summers
On 1/9/06, Paul POULAIN <[EMAIL PROTECTED]> wrote: > what am I doing wrong ? Using Perl? :-) Seriously though, i'd be interested in DProf [1] output from your program if you have the energy. //Ed [1] http://search.cpan.org/~ilyaz/DProf-19990108/DProf.pm

MARC::File::XML perfs

2006-01-09 Thread Paul POULAIN
Hi, I begin using MARC::File::XML in Koha, to read XML records and transform them in MARC::Record. I tried some perfs and it seems they are really poor. something like : * reading a line in mySQL (1 DB line, indexed, so really fast, contains a MARCXML field) * create a MARC::Record with my

Re: MARC::File::XML

2006-01-06 Thread Paul POULAIN
Ed Summers a écrit : I'm curious what people would find to be the best default behavior for MARC::File::XML when it creates a MARC::Record object from XML. Should the character encoding by default by transposed from UTF-8 to MARC-8? Or should it be left as UTF-8? maybe a new_from_xml

MARC::File::XML

2006-01-06 Thread Ed Summers
I'm curious what people would find to be the best default behavior for MARC::File::XML when it creates a MARC::Record object from XML. Should the character encoding by default by transposed from UTF-8 to MARC-8? Or should it be left as UTF-8? //Ed

Updating MARC::File::XML (was Re: Character sets - kind of solved?)

2004-12-06 Thread Mike Rylander
at app's > character set was apparently Unicode (UTF-8) and so E5 and F2 were > replaced by U+FFFD. That's the long way of saying that the patch should > work fine in his case. :-) > I understand. It wasn't that I was trying to solve that particular problem, it just go

  1   2   >