Thanks, Simos for your kind and time. Much appreciated to Javier for brought a good solution indeed.
Lao input method is need a similar solution. Javier please post your solution (where and how to define a new table for Khmer) so I can define these code points for Lao. Thanks, Anousak The Lao team On Tue, Jun 10, 2008 at 1:58 AM, Simos Xenitellis <[EMAIL PROTECTED]> wrote: > O/H Javier SOLA έγραψε: >> >> Thanks Simos !! >> >> Actually, we have had these additions for a while in X11. > > Hi Javier, > > Checking at > http://gitweb.freedesktop.org/?p=xorg/lib/libX11.git;a=tree;f=nls/en_US.UTF-8 > does not show these lines at the end. It is possible that these compose > sequences were added as a patch to the distribution package. >> >> We will do an issue for GTK+, and use the variable meanwhile. >> >> What file is it in GTK+? I have not been able to find it. > > In GTK+ (HEAD), the relevant file is > http://svn.gnome.org/viewvc/gtk%2B/trunk/gtk/gtkimcontextsimple.c?view=markup > > However, your case of compose sequences is different from the existing > compose sequences, that result to a single codepoint (you require to produce > two codepoints). > > Therefore, the type of support you are looking for is similar to compose > sequences that result to letter+diacritic mark. Several languages have > characters that no pre-composed letters exist, so the compose sequence > produces letter+diacritic marks (more than one codepoint). Such support is > missing, and there are already bug reports for them. > > Bug 341341 – Compose mechanism in simple input method doesn't support > decomposed forms > http://bugzilla.gnome.org/show_bug.cgi?id=341341 > > Bug 345254 – dead accents should at least produce combining characters > http://bugzilla.gnome.org/show_bug.cgi?id=345254 > > There is a shortcut when trying to solve the above cases of compose > sequences, thus the solution I expect to be different from the Khmer compose > sequences. > Specifically, for the Latin compose sequences, such as (it's a made up > example) > > <dead_acute> <t> : "t́" # LETTER T WITH ACUTE > > one could convert to something like [ dead_acute, 't', 0]. > We would put 0 for the resulting codepoint because we can deduce for this > category of compose sequences that the actual codepoints are 't' and 'acute' > (the resulting codepoints match the body of the compose sequence). > > However, for the case of Khmer, the compose sequences look independent from > the resulting code points. Therefore, a new table should be required. > > To cut the story short, I have filed a bug report for this, > Bug 537457 – Support compose sequences that produce two+ codepoints > http://bugzilla.gnome.org/show_bug.cgi?id=537457 > > Simos > >> >> Thanks, >> >> Javier >> >> Simos Xenitellis wrote >>> >>> O/H Javier SOLA έγραψε: >>>> >>>> Hi, >>>> >>>> I am working on Khmer localization (KhmerOS project). >>>> >>>> In Khmer, some of the basic vowels (which we include in the keyboard) >>>> require two code-points, so one keystroke must generate two code points. >>>> >>>> It used to be that we could do the conversion in KBX by generating a >>>> fictious code-point (Pablo Saratxaga explained this to us a few years ago), >>>> which was later translated to two real code-points by puting the conversion >>>> in the en-US locale file. I did work at the time. >>>> >>>> But now this seems to have stopped working. Does anybody knows how we >>>> can fix this? >>> >>> These additions (pressing a single key and producing two codepoints), are >>> located at >>> /usr/share/X11/locale/en_US.UTF-8/Compose >>> The specific lines appear to be >>> >>> # Khmer digraphs >>> # A keystroke has to generate several characters, so they are defined >>> # in this file >>> >>> <U17fb> : "ុះ" >>> <U17fc> : "ុំ" >>> <U17fd> : "េះ" >>> <U17fe> : "ោះ" >>> <U17ff> : "ាំ" >>> >>> GTK+ based applications duplicate the Compose file in the gtk+ library, >>> and currently the version of the Compose file that exists in gtk+ does not >>> include those specific compose sequences. >>> I think these are a recent addition. >>> Technically, it is possible for gtk+ to include compose sequences that >>> produce more than one code points (requires small change in the code), >>> however these recent Khmer digraphs are the only compose sequences using the >>> facility now. >>> >>> To cut the long story short, you can bypass for now the GTK+ version of >>> the Compose file and use the Compose file that comes with X.Org (shown >>> above) by setting the environment variable GTK_IM_MODULE to "xim". >>> This should not have adverse effect to the OLPC software. >>> >>> It is important that if other keyboard layouts as well require compose >>> sequences that produce >>> two or more codepoints (such as Serbian), to add them to the XOrg Compose >>> file. In the next iteration of update of the GTK+, all these compose >>> sequences can make it in. >>> >>> Simos >>> >>> >> >> > > _______________________________________________ > gnome-i18n mailing list > [EMAIL PROTECTED] > http://mail.gnome.org/mailman/listinfo/gnome-i18n > -- Anousak (Anthony) Souphavanh "Small can make a big impact" _______________________________________________ gtk-i18n-list mailing list gtk-i18n-list@gnome.org http://mail.gnome.org/mailman/listinfo/gtk-i18n-list