Re: MARC-8 to UTF-8 conversion

2005-12-05 Thread Edward Summers
On Dec 5, 2005, at 8:33 PM, Brad Baxter wrote: I think you're correct to be conservative. I've been spoiled by servers with lots of memory, so my judgement may be in question. :-) Wow, AnyDBM_File looks perfect. It'll use ndbm, then Berkeley DB, GDBM, and then fall back on SDBM. Like you sa

Re: MARC-8 to UTF-8 conversion

2005-12-05 Thread Brad Baxter
On 12/5/05, Ed Summers <[EMAIL PROTECTED]> wrote: > > > Am I right that this amounts to less than 1Meg (EastAsian.db + > > UTF8.db)? Depending on your system and your needs (more > > speed?), that may not be considered large and might fit into > > memory fine. Otherwise, I think any of the in-core

Re: MARC-8 to UTF-8 conversion

2005-12-05 Thread Brad Baxter
On 12/5/05, Ed Summers <[EMAIL PROTECTED]> wrote: > > On 12/5/05, Doran, Michael D <[EMAIL PROTECTED]> wrote: > > So... this is all very interesting (and I've definitely learned > > something here), but like I suggested previously, this level of digging > > may be a bit beyond the "casual" Perl

Re: MARC-8 to UTF-8 conversion

2005-12-05 Thread Mike Rylander
heh ... edition, even. That is all. :) On 12/5/05, Mike Rylander <[EMAIL PROTECTED]> wrote: > On 12/5/05, Ed Summers <[EMAIL PROTECTED]> wrote: > > > Am I right that this amounts to less than 1Meg (EastAsian.db + > > > UTF8.db)? Depending on your system and your needs (more > > > speed?), that m

Re: MARC-8 to UTF-8 conversion

2005-12-05 Thread Mike Rylander
On 12/5/05, Ed Summers <[EMAIL PROTECTED]> wrote: > > Am I right that this amounts to less than 1Meg (EastAsian.db + > > UTF8.db)? Depending on your system and your needs (more > > speed?), that may not be considered large and might fit into > > memory fine. Otherwise, I think any of the in-core (

Re: MARC-8 to UTF-8 conversion

2005-12-05 Thread Ed Summers
> Am I right that this amounts to less than 1Meg (EastAsian.db + > UTF8.db)? Depending on your system and your needs (more > speed?), that may not be considered large and might fit into > memory fine. Otherwise, I think any of the in-core (non-DB_File) > DBM files ought to suffice for that amount

Re: MARC-8 to UTF-8 conversion

2005-12-05 Thread Ed Summers
Ok, this is great information to have moving forward wi the next MARC::Charset...many thanks Michael and Jason. Micheal you are totally right the installer really shouldn't fail like that...I'd never tested it on a system that lacked DB_File so I didn't know. And CPAN testers didn't pick it up eith

Re: MARC-8 to UTF-8 conversion

2005-12-05 Thread Brad Baxter
On 12/5/05, Doran, Michael D <[EMAIL PROTECTED]> wrote: > > > For those who aren't familiar with the MARC 21 alternate character set > repertoires (specifically, the East Asian ideographs), by "largish", Ed > is talking on the order of a table containing upwards of 16,000 > mappings. > > Am I right

RE: MARC-8 to UTF-8 conversion

2005-12-05 Thread Thomale, J
> I'm probably starting to sound nit-picky, but please > understand that it's only because I think MARC::Charset is a > great module and I'd like for more people to be using it. :-) Let me second Michael's statement. A couple of months ago we tried installing MARC::Charset and ran into exactly

RE: MARC-8 to UTF-8 conversion

2005-12-05 Thread Doran, Michael D
> If anyone has any suggestions on how to handle a > largish character mapping table [...] For those who aren't familiar with the MARC 21 alternate character set repertoires (specifically, the East Asian ideographs), by "largish", Ed is talking on the order of a table containing upwards of 16,000

Re: MARC-8 to UTF-8 conversion

2005-12-05 Thread Ed Summers
On 12/5/05, Doran, Michael D <[EMAIL PROTECTED]> wrote: > So... this is all very interesting (and I've definitely learned > something here), but like I suggested previously, this level of digging > may be a bit beyond the "casual" Perl user. ;-) Yep, point taken. I'm guessing you are right: w

RE: MARC-8 to UTF-8 conversion

2005-12-05 Thread Doran, Michael D
Ed, ED > I don't really understand why Perl 5.8.7 lacked DB_File since ED > Module::CoreList [...] reports it being standard sine 5.00307. Perhaps ED > this is some sort of emasculated version that ships with Solaris :-) Nope, I wasn't using a "Perl lite" version. ;-) Although Solaris now comes

Re: MARC-8 to UTF-8 conversion

2005-12-05 Thread Ed Summers
I don't really understand why Perl 5.8.7 lacked DB_File since Module::CoreList [1] reports it being standard sine 5.00307. Perhaps this is some sort of emasculated version that ships with Solaris :-) [EMAIL PROTECTED] ed]$ corelist DB_File DB_File was first released with perl 5.00307 It loo

RE: MARC-8 to UTF-8 conversion

2005-12-05 Thread Doran, Michael D
Hi Ed, > -Original Message- > From: Edward Summers [mailto:[EMAIL PROTECTED] > Sent: Monday, December 05, 2005 6:14 AM > To: perl4lib > Subject: Re: MARC-8 to UTF-8 conversion > > On Dec 2, 2005, at 9:01 AM, Doran, Michael D wrote: > > > Installing the MARC::Charset module can be a bit

Re: MARC-8 to UTF-8 conversion

2005-12-05 Thread Edward Summers
On Dec 2, 2005, at 9:01 AM, Doran, Michael D wrote: Installing the MARC::Charset module can be a bit problematic for the casual Perl user, due to the prerequisites. Is DB_File a big deal as a prerequisite? it's been in Perl since 5.00307. The other prereq is perl 5.8, but doing unicode wor

Re: MARC-8 to UTF-8 conversion

2005-12-05 Thread Edward Summers
On Dec 5, 2005, at 2:40 AM, bargioni wrote: If you have some tricks, please let me know. Apart from following the examples in the documentation I don't know of any tricks :-) Seriously though I'm about to release a new MARC::Charset that uses the LoC XML mapping table, and a simplified in

Re: MARC-8 to UTF-8 conversion

2005-12-05 Thread bargioni
Dear Doran, Ed, Bill and others: thank you for your replies. I installed MARC::Charset using the CPAN module to ensure dependencies. I don't know why it is not working well. If you have some tricks, please let me know. Although I'm interested in MARC-8 -> UTF-8 conversion in memory, a good way