Re: Advice on how to approach character translation

2008-04-26 Thread Dr.Ruud
J. Peng schreef: > Dr.Ruud: >> J. Peng: >>> Dr.Ruud: Jenda Krynicky: >@signs = map quotemeta($_) @signs; @signs = map quotemeta($_), @signs; (there was a comma missing) which you could even write as @signs = map quotemeta, @signs; >>> >>> or: >>>

Re: Advice on how to approach character translation

2008-04-26 Thread J. Peng
On Sat, Apr 26, 2008 at 7:54 PM, Dr.Ruud <[EMAIL PROTECTED]> wrote: > J. Peng schreef: > > Dr.Ruud: > >> Jenda Krynicky: > > > >>>@signs = map quotemeta($_) @signs; > >> > >> @signs = map quotemeta($_), @signs; > >> (there was a comma missing) > >> which you could even write as > >

Re: Advice on how to approach character translation

2008-04-26 Thread Dr.Ruud
J. Peng schreef: > Dr.Ruud: >> Jenda Krynicky: >>>@signs = map quotemeta($_) @signs; >> >> @signs = map quotemeta($_), @signs; >> (there was a comma missing) >> which you could even write as >> @signs = map quotemeta, @signs; > > or: > @signs = map { quotemeta } @signs; That i

Re: Advice on how to approach character translation

2008-04-26 Thread J. Peng
On Sat, Apr 26, 2008 at 5:41 PM, Dr.Ruud <[EMAIL PROTECTED]> wrote: > "Jenda Krynicky" schreef: > > > > @signs = map quotemeta($_) @signs; > > @signs = map quotemeta($_), @signs; > > (there was a comma missing) > > which you could even write as > > @signs = map quotemeta, @signs; >

Re: Advice on how to approach character translation

2008-04-26 Thread Dr.Ruud
"Jenda Krynicky" schreef: > @signs = map quotemeta($_) @signs; @signs = map quotemeta($_), @signs; (there was a comma missing) which you could even write as @signs = map quotemeta, @signs; -- Affijn, Ruud "Gewoon is een tijger." sub uniq { my $prev; map

Re: Advice on how to approach character translation

2008-04-25 Thread R (Chandra) Chandrasekhar
Jenda Krynicky wrote: ... and build a regexp to match the 1-3 characters to replace: @signs = sort {length($b) <=> length($a)} keys %trans; Thanks for this priceless construct. It was very helpful indeed. @signs = map quotemeta($_) @signs; @signs = map quotemeta($_), @signs; # needed a c

Re: Advice on how to approach character translation

2008-04-24 Thread Chas. Owens
On Thu, Apr 24, 2008 at 11:40 AM, R (Chandra) Chandrasekhar <[EMAIL PROTECTED]> wrote: > Chas. Owens wrote: > > > > The easiest way I can think of is to build a (UTF-8) file named > > itrans2unicode.table that looks like this > > > > a => a > > aa => ā > > ~N => ṅ > > > > > > I have successfully

Re: Advice on how to approach character translation

2008-04-24 Thread R (Chandra) Chandrasekhar
Chas. Owens wrote: The easiest way I can think of is to build a (UTF-8) file named itrans2unicode.table that looks like this a => a aa => ā ~N => ṅ I have successfully created the file lookup.table containing lines as suggested above with ASCII and Unicode characters separated by ' => '.

Re: Advice on how to approach character translation

2008-04-23 Thread Jenda Krynicky
From: "R (Chandra) Chandrasekhar" <[EMAIL PROTECTED]> > 3. Some transliteration examples are shown below: > > a a U+0061 LATIN SMALL LETTER A > aa a U+0101 LATIN SMALL LETTER A WITH MACRON > A a U+0101 LATIN SMALL LETTER A WITH MACRON > .a ' U+0027 APOSTROPHE >

Re: Advice on how to approach character translation

2008-04-23 Thread Chas. Owens
On Wed, Apr 23, 2008 at 5:34 AM, R (Chandra) Chandrasekhar <[EMAIL PROTECTED]> wrote: > Dear Folks, > > A scheme called ITRANS uses the ASCII printing character set and between > one and three printing characters to unambiguously represent characters in > Indic scripts or a Romanized script calle

Advice on how to approach character translation

2008-04-23 Thread R (Chandra) Chandrasekhar
Dear Folks, A scheme called ITRANS uses the ASCII printing character set and between one and three printing characters to unambiguously represent characters in Indic scripts or a Romanized script called IAST. Since characters in these scripts have Unicode code points, it should be possible to