Re: [XeTeX] Σχετ: Re: Assignment of codes (particularly \catcode) based on Unicode data

2015-05-07 Thread Philip Taylor
Apostolos Syropoulos wrote: > The only mark that remains when making all capitals is the dieredis > (dialytika). All other vanish. This is common knowledge for people who > speak and write Greek. Well, this is not the opinion of (for example) Dr Charalambos Dendrinos, a native Greek speaker and

Re: [XeTeX] Bug fixes and new features related to Unicode character codes, surrogates, etc

2015-05-07 Thread David Carlisle
On 7 May 2015 at 02:07, Ross Moore wrote: > Hi David, > > .. > > No disagreement to this. > > OK:-) > > In the current versions d835dc00 is two characters in luatex > and one character in xetex > as the implementation detail that xetex's underlying storage is mostly > UTF-16 is exp

[XeTeX] Σχετ: Re: Σχετ: Re: Assignment of codes (particularly \catcode) based on Unicode data

2015-05-07 Thread Apostolos Syropoulos
Well I do not know what Dendrinos says I just happen to know what people do in typography and in everyday practice. Also I happen to be a native Greek speakee... Στάλθηκε από το Ταχυδρομείο Yahoo στο Android Από:"Philip Taylor" Ληφθέντα:Πέμ, 7 Μαϊ, 2015 στις 11:34 Θέμα:Re: Σχετ: Re: [XeTeX] As

Re: [XeTeX] Σχετ: Re: Assignment of codes (particularly \catcode) based on Unicode data

2015-05-07 Thread Jonathan Kew
On 7/5/15 09:34, Philip Taylor wrote: Apostolos Syropoulos wrote: The only mark that remains when making all capitals is the dieredis (dialytika). All other vanish. This is common knowledge for people who speak and write Greek. Well, this is not the opinion of (for example) Dr Charalambos D

Re: [XeTeX] Σχετ: Re: Σχετ: Re: Assignment of codes (particularly \catcode) based on Unicode data

2015-05-07 Thread Julian Bradfield
On 2015-05-07, Apostolos Syropoulos wrote: > Well I do not know what Dendrinos says I just happen to know what people do in > typography and in everyday practice. Which is not what the Unicode uppercase mapping is for. The uppercase mapping in the data file gives a default mapping, which is appr

Re: [XeTeX] Σχετ: Re: Assignment of codes (particularly \catcode) based on Unicode data

2015-05-07 Thread Joseph Wright
On 07/05/2015 10:56, Jonathan Kew wrote: > On 7/5/15 09:34, Philip Taylor wrote: >> >> >> Apostolos Syropoulos wrote: >> >>> The only mark that remains when making all capitals is the dieredis >>> (dialytika). All other vanish. This is common knowledge for people who >>> speak and write Greek. >> >

[XeTeX] Case changing for Greek

2015-05-07 Thread Joseph Wright
Hello all, The question of case changing in Greek has come up in another thread. Whilst the details here aren't XeTeX (or even TeX) specific, given the interest by members of the list I hope I can take advantage to ask about the area. For work on LaTeX3/expl3 we've put together an approach to cas

Re: [XeTeX] Σχετ: Re: Assignment of codes (particularly \catcode) based on Unicode data

2015-05-07 Thread Philip Taylor
Jonathan Kew wrote: > I still maintain that the default code values assigned in formats > such as xe(la)tex should be based directly on the Unicode properties. It > would be great to have a Greek package that implements proper Greek > uppercasing, but this level of language- and orthography-spec

Re: [XeTeX] Case changing for Greek

2015-05-07 Thread Philip Taylor
Joseph Wright wrote: > For performance reasons that code has been set up to assume that a > sigma is final if it is followed by a space, a control sequence or a > character from the list > > ) ] } . : ; , ! ? ' " The inclusion of "a control sequence" worries me; may I ask why you do not pr

Re: [XeTeX] Case changing for Greek

2015-05-07 Thread Joseph Wright
On 07/05/2015 13:40, Philip Taylor wrote: > > > Joseph Wright wrote: > >> For performance reasons that code has been set up to assume that a >> sigma is final if it is followed by a space, a control sequence or a >> character from the list >> >> ) ] } . : ; , ! ? ' " > > The inclusion of "a

Re: [XeTeX] Case changing for Greek

2015-05-07 Thread Nikos Platis
2015-05-07 15:22 GMT+03:00 Joseph Wright : > For performance reasons that code has been set up to assume that a > sigma is final if it is followed by a space, a control sequence or a > character from the list > > ) ] } . : ; , ! ? ' " > > I would add to this list the dashes, "anoteleia", the g

Re: [XeTeX] Case changing for Greek

2015-05-07 Thread Jonathan Kew
On 7/5/15 13:22, Joseph Wright wrote: Hello all, The question of case changing in Greek has come up in another thread. Whilst the details here aren't XeTeX (or even TeX) specific, given the interest by members of the list I hope I can take advantage to ask about the area. For work on LaTeX3/exp

Re: [XeTeX] Case changing for Greek

2015-05-07 Thread Joseph Wright
On 07/05/2015 14:23, Nikos Platis wrote: > 2015-05-07 15:22 GMT+03:00 Joseph Wright : > >> For performance reasons that code has been set up to assume that a >> sigma is final if it is followed by a space, a control sequence or a >> character from the list >> >> ) ] } . : ; , ! ? ' " >> >> > I

Re: [XeTeX] Σχετ: Re: Σχετ: Re: Assignment of codes (particularly \catcode) based on Unicode data

2015-05-07 Thread Édouard Gilbert
> Le 7 mai 2015 à 11:57, Julian Bradfield a écrit : > > On 2015-05-07, Apostolos Syropoulos wrote: >> Well I do not know what Dendrinos says I just happen to know what people do >> in >> typography and in everyday practice. > > Which is not what the Unicode uppercase mapping is for. The uppe

Re: [XeTeX] Case changing for Greek

2015-05-07 Thread Joseph Wright
On 07/05/2015 14:26, Jonathan Kew wrote: > FWIW, we've done some work on this in Mozilla in the past few years, to > provide language-appropriate behavior for CSS features like > text-transform:uppercase and font-variant:small-caps. You might like to > review the discussion in bug reports such as >

Re: [XeTeX] Case changing for Greek

2015-05-07 Thread Jonathan Kew
On 7/5/15 13:22, Joseph Wright wrote: Included in that 'standard' set up is the final sigma rule for Greek text. For performance reasons that code has been set up to assume that a sigma is final if it is followed by a space, a control sequence or a character from the list ) ] } . : ; , ! ?

Re: [XeTeX] Case changing for Greek

2015-05-07 Thread Joseph Wright
On 07/05/2015 15:02, Jonathan Kew wrote: > On 7/5/15 13:22, Joseph Wright wrote: >> Included in that 'standard' set up is the final sigma rule for Greek >> text. For performance reasons that code has been set up to assume that a >> sigma is final if it is followed by a space, a control sequence or

Re: [XeTeX] Case changing for Greek

2015-05-07 Thread BPJ
Den 2015-05-07 16:02, Jonathan Kew skrev: Would it be feasible to define this negatively instead -- something like "a sigma is final if it is NOT followed by another letter"? A possible refinement is that a lone sigma, neither preceded nor followed by another letter, should probably be lowercase

Re: [XeTeX] Case changing for Greek

2015-05-07 Thread Apostolos Syropoulos
>> A possible refinement is that a lone sigma, neither preceded nor >> followed by another letter, should probably be lowercased as σ rather >> than ς. > > One that needs input from a Greek speaker! > That is correct Jonathan. In fact the general rule is that a σ at the end of a word becomes alwa

Re: [XeTeX] Assignment of codes (particularly \catcode) based on Unicode data

2015-05-07 Thread Apostolos Syropoulos
> pieces of software, so is cross-compatible with other stuff. Third, as a > non-Greek I can't comment on the technical correctness of what you say! Obviously we are educated people here and I would not dare to say that you do not speak English... > Is there some place I could see this discusse