Hi,
I have some questions with UTF-8 & zebra (& MARC::Record & MARC::XML,
but I'm not sure to know which tool is responsible of my problem)
In my zebra config file I have :
recordType: grs.xml
encoding utf-8
MARC::Record is the sourceforge 2.0 version (installed today)
zebra version is 1.3.32
MARC::XML version is 0.7
YAZ version: 2.1.12
The following code (some line removed) :
$Zconn->option(cqlfile =>
C4::Context->config("intranetdir")."/zebra/pqf.properties");
$Zconn->option(preferredRecordSyntax => "xml");
my $rs = $Zconn->search($q);
for (my $i=$offset-1; $i <= $maxrecordnum-1; $i++) {
my $record = MARC::Record->new_from_xml($rs->record($i)->raw());
warn "REC2 = ".$record->as_formatted;
> }
shows in log :
Dictionnaire fran\xc3\xa7ais-anglais des termes relatifs \xc3\xa0
l'\xc3\xa9lectronique, l'\xc3\xa9lectrotechnique,
\xc3\xa7 is a ç, it should be 00E7,
\xc3\xa0 is a à, it should be 00E0,
\xc3\xa9 is a é, it should be 00E9.
(or i'm wrong somewhere, I must admit i'm a newbie at utf-8, you'll let
me know)
(If I directly dump the XML record returned by zebra, I get the same
result, so the problem is probably not in MARC::Record or MARC::XML)
Could someone help me finding the origin of the problem ?
--
Paul POULAIN et Henri Damien LAURENT
Consultants indépendants
en logiciels libres et bibliothéconomie (http://www.koha-fr.org)