Re: grapheme cluster library

2017-10-23 Thread Thomas Jollans
On 23/10/17 16:25, Rustom Mody wrote: > On Monday, October 23, 2017 at 1:15:35 PM UTC+5:30, Steve D'Aprano wrote: >> >> and more. Many linguists also include digraphs (pairs of letters) like the >> English "th", "sh", "qu", or "gh" as graphemes. >> >> >> https://www.thoughtco.com/what-is-a-grapheme

Re: grapheme cluster library

2017-10-23 Thread Rustom Mody
On Monday, October 23, 2017 at 1:15:35 PM UTC+5:30, Steve D'Aprano wrote: > On Mon, 23 Oct 2017 05:47 pm, Rustom Mody wrote: > > > On Monday, October 23, 2017 at 8:06:03 AM UTC+5:30, Lawrence D’Oliveiro > > wrote: > [...] > >> Bear in mind that the logical representation of the text is as code poi

Re: grapheme cluster library

2017-10-23 Thread Steve D'Aprano
On Mon, 23 Oct 2017 05:47 pm, Rustom Mody wrote: > On Monday, October 23, 2017 at 8:06:03 AM UTC+5:30, Lawrence D’Oliveiro > wrote: [...] >> Bear in mind that the logical representation of the text is as code points, >> graphemes would have more to do with rendering. > > Heh! Speak of Euro/Anglo-

Re: grapheme cluster library (Posting On Python-List Prohibited)

2017-10-22 Thread Rustom Mody
On Monday, October 23, 2017 at 8:06:03 AM UTC+5:30, Lawrence D’Oliveiro wrote: > On Saturday, October 21, 2017 at 5:11:13 PM UTC+13, Rustom Mody wrote: > > Is there a recommended library for manipulating grapheme clusters? > > Is this any goo

Re: grapheme cluster library

2017-10-21 Thread Steven D'Aprano
On Fri, 20 Oct 2017 21:11:02 -0700, Rustom Mody wrote: > Is there a recommended library for manipulating grapheme clusters? Back in July, I asked for anyone interested in grapheme clusters to consider checking out this issue on the bug tracker: http://bugs.python.org/issue30717 My post receive

Re: grapheme cluster library

2017-10-21 Thread Rustom Mody
On Saturday, October 21, 2017 at 9:22:24 PM UTC+5:30, MRAB wrote: > On 2017-10-21 05:11, Rustom Mody wrote: > > Is there a recommended library for manipulating grapheme clusters? > > > > In particular, in devanagari > > क् + ि = कि > > in (pseudo)unicode names > > KA-letter + I-sign = KI-composite

Re: grapheme cluster library

2017-10-21 Thread MRAB
On 2017-10-21 05:11, Rustom Mody wrote: Is there a recommended library for manipulating grapheme clusters? In particular, in devanagari क् + ि = कि in (pseudo)unicode names KA-letter + I-sign = KI-composite-letter I would like to be able to handle KI as a letter rather than two code-points. Can

Re: grapheme cluster library

2017-10-21 Thread Rustom Mody
On Saturday, October 21, 2017 at 11:51:57 AM UTC+5:30, Chris Angelico wrote: > On Sat, Oct 21, 2017 at 3:25 PM, Stefan Ram wrote: > > Rustom Mody writes: > >>Is there a recommended library for manipulating grapheme clusters? > > > > The Python Library has a module "unicodedata", with functions l

Re: grapheme cluster library

2017-10-20 Thread Chris Angelico
On Sat, Oct 21, 2017 at 3:25 PM, Stefan Ram wrote: > Rustom Mody writes: >>Is there a recommended library for manipulating grapheme clusters? > > The Python Library has a module "unicodedata", with functions like: > > |unicodedata.normalize( form, unistr ) > | > |Returns the normal form »form«

grapheme cluster library

2017-10-20 Thread Rustom Mody
Is there a recommended library for manipulating grapheme clusters? In particular, in devanagari क् + ि = कि in (pseudo)unicode names KA-letter + I-sign = KI-composite-letter I would like to be able to handle KI as a letter rather than two code-points. Can of course write an automaton to group bu