Re: [Koha-zebra] Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-21 Thread Pierrick LE GALL
On Mon, 20 Mar 2006 10:54:08 -0500 "Mike Rylander" <[EMAIL PROTECTED]> wrote: > Except that Perl doesn't know that the data is already UTF8 ... which > is the problem. [...] You're completely right, I understand the difference. We made UTF8 work from MySQL bu we didn't tried to work on data comin

Re: [Koha-zebra] Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-20 Thread Mike Rylander
On 3/20/06, Pierrick LE GALL <[EMAIL PROTECTED]> wrote: > Hello Mike, > > I'll answer to the second question, since I worked with Paul on > Perl/MySQL and UTF-8... > > On Mon, 20 Mar 2006 09:59:32 -0500 > "Mike Rylander" <[EMAIL PROTECTED]> wrote: > > > Are you using decode_utf8($mysql_string) to l

Re: [Koha-zebra] Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-20 Thread Pierrick LE GALL
Hello Mike, I'll answer to the second question, since I worked with Paul on Perl/MySQL and UTF-8... On Mon, 20 Mar 2006 09:59:32 -0500 "Mike Rylander" <[EMAIL PROTECTED]> wrote: > Are you using decode_utf8($mysql_string) to let Perl know that the > database is UTF8 encoded? IIRC, MySQL doesn't

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-20 Thread Paul POULAIN
Mike Rylander a écrit : On 3/20/06, Paul POULAIN <[EMAIL PROTECTED]> wrote: Mike Rylander a écrit : I tested with the record you sent Ed and me, and everything seems to work for me ... As you can see, I tested several variants of the UNIMARC flag, and even tested not sending the encoding to n

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-20 Thread Mike Rylander
On 3/20/06, Paul POULAIN <[EMAIL PROTECTED]> wrote: > Mike Rylander a écrit : > > I tested with the record you sent Ed and me, and everything seems to > > work for me ... > > As you can see, I tested several variants of the UNIMARC flag, and > > even tested not sending the encoding to new_from_xml(

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-20 Thread Paul POULAIN
Mike Rylander a écrit : I tested with the record you sent Ed and me, and everything seems to work for me ... As you can see, I tested several variants of the UNIMARC flag, and even tested not sending the encoding to new_from_xml() ... it all seems to work for me, and I'm not sure what problems yo

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-17 Thread Mike Rylander
I tested with the record you sent Ed and me, and everything seems to work for me ... comparing the preprocessed XML with a copy that I round-tripped through MARC::Record and MARC::File::XML, they look the same. Here's my little test script (unimarc-test.pl): --

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-17 Thread Paul POULAIN
Mike Rylander a écrit : CVS checkout intsructions cvs -d:pserver:[EMAIL PROTECTED]:/cvsroot/marcpm login cvs -z3 -d:pserver:[EMAIL PROTECTED]:/cvsroot/marcpm co -P marc-xml Then, cd marc-xml perl Makefile.PL make make test And assuming 'make test' succeeds ... make install I upd

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-16 Thread Mike Rylander
Mea culpa ... read on. :) On 3/16/06, Mike Rylander <[EMAIL PROTECTED]> wrote: > I've updated the cvs for MARC::File::XML with what I described below, > with one caveat. The one difference from what I was planning is that, > because as_xml() is generated by MARC::Record, I can't give it new > par

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-16 Thread Ed Summers
On 3/16/06, Mike Rylander <[EMAIL PROTECTED]> wrote: > Will some brave soul please test this with some UNIMARC records and > let me know how it goes? Yes please, add the test to the test suite if possible Joshua and Paul. miker_++ //Ed

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-16 Thread Mike Rylander
I've updated the cvs for MARC::File::XML with what I described below, with one caveat. The one difference from what I was planning is that, because as_xml() is generated by MARC::Record, I can't give it new parameters. To test exporting to XML you'll need to set the record format for export eithe

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-16 Thread Mike Rylander
I've been attempting to beat the MARC::File::XML stuff into a usable shape as of late, so I'm going to take a stab at fixing this. There will be some limitations (at first) as to what encodings we'll accept for UNIMARC records, but I'll cover the cases that I know about (and understand). Here's t

Re: Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-16 Thread Zeno Tajoli
Hi, PROBLEM : * in MARC21, the encoding is defined by position 9 of the leader. 'a' means UTF-8 * in UNIMARC, this is an empty position ! the encoding is in positions 26-27 and 28-29 of 100$a (<200 are all fixed coded fields in unimarc : http://bibliotheque.bgp-fr.com/Unimarc_abrege.pdf, page

Unimarc, marc21, Unicode, and MARC::File::XML

2006-03-16 Thread Paul POULAIN
Hello all, Still working on UNICODE in Koha. We are stuck with a not-so-nice problem. (Many many thanks to the librarians that wrote marc21 and unimarc standards...) I explain : yesterday : joshua "the new marc::file::xml works fine with utf8 now". me : "Great ! i'll give it a try" today :