Re: [XeTeX] Polyglossia: Support for romanization of CJK

2011-06-22 Thread Andy Lin
I'd forgotten about the state of Japanese romanization. I'm just so used to seeing some form of mangled Hepburn because people don't bother typing macrons. But this type of fragmentation is mostly acceptable (that is, people can still understand each other despite the variation) and doesn't necessa

Re: [XeTeX] Polyglossia: Support for romanization of CJK

2011-06-21 Thread Arthur Reutenauer
> tsu-kattari, > tsuka-ttari or tsukatta-ri ; second hyphenation preferable to > tsukat-tari because of how the word is segmented when spoken. Is there any established tradition on this? Hyphenation often doesn't respect syllable boundaries, a

Re: [XeTeX] Polyglossia: Support for romanization of CJK

2011-06-20 Thread Mike "Pomax" Kamermans
On 6/19/2011 9:59 PM, Andy Lin wrote: Do you have examples of this outside of proper names? I haven't come across such a case and it'd be interesting to see. But wouldn't such variation defeat the purpose of romanization? I mean, consider the confusion that the Taiwanese government has caused b

Re: [XeTeX] Polyglossia: Support for romanization of CJK

2011-06-19 Thread Andy Lin
On Sat, Jun 18, 2011 at 14:09, Gerrit wrote: > I thought of romanization which is specific to the language the surrounding > text is written in - e.g. French or German. And this is not so much the case > for Chinese or Japanese (except for some words: Beijing in English but > Peking in German). In

Re: [XeTeX] Polyglossia: Support for romanization of CJK

2011-06-18 Thread Gerrit
Am 17.06.2011 19:29, schrieb Andy Lin: You have to be careful with Chinese. In Mandarin, you have Pinyin, but you also have several conflicting romanization schemes in use in Taiwan and older literature. For Cantonese, Hokkien, etc, syllable boundaries are not quite as easy to determine because t

Re: [XeTeX] Polyglossia: Support for romanization of CJK

2011-06-17 Thread Andy Lin
On Thu, Jun 16, 2011 at 06:38, Gerrit wrote: > For Japanese and Chinese the advantage is that you have a universal > romanization, You have to be careful with Chinese. In Mandarin, you have Pinyin, but you also have several conflicting romanization schemes in use in Taiwan and older literature. F

Re: [XeTeX] Polyglossia: Support for romanization of CJK

2011-06-17 Thread Pander
On 2011-06-17 16:17, Pander wrote: > On 2011-06-17 16:11, Arthur Reutenauer wrote: >>> Pinyin patterns already exist in our repository, so one should be able >>> to use them out-of-the-box, while for the rest we don't have anything >>> yet. >> >> I have patterns that work for Japanese, I'll send

Re: [XeTeX] Polyglossia: Support for romanization of CJK

2011-06-17 Thread Pander
On 2011-06-17 16:11, Arthur Reutenauer wrote: >> Pinyin patterns already exist in our repository, so one should be able >> to use them out-of-the-box, while for the rest we don't have anything >> yet. > > I have patterns that work for Japanese, I'll send an e-mail about it > later; first I have

Re: [XeTeX] Polyglossia: Support for romanization of CJK

2011-06-17 Thread Arthur Reutenauer
> Pinyin patterns already exist in our repository, so one should be able > to use them out-of-the-box, while for the rest we don't have anything > yet. I have patterns that work for Japanese, I'll send an e-mail about it later; first I have something else to finish ;-) Arthur

Re: [XeTeX] Polyglossia: Support for romanization of CJK

2011-06-17 Thread Mojca Miklavec
2011/6/16 Ulrike Fischer wrote: > Am Thu, 16 Jun 2011 12:38:57 +0200 schrieb Gerrit: > > >> yes, I thought exactly of such a few words in a western text. >> >> For example, in situations like this: “A town where many hot springs >> (/onsen/) are located is Beppu in Kyūshū. ” >> Here, you have three

Re: [XeTeX] Polyglossia: Support for romanization of CJK

2011-06-16 Thread Ulrike Fischer
Am Thu, 16 Jun 2011 12:38:57 +0200 schrieb Gerrit: > yes, I thought exactly of such a few words in a western text. > > For example, in situations like this: “A town where many hot springs > (/onsen/) are located is Beppu in Kyūshū. ” > Here, you have three Japanese words in a text: onsen, Beppu

Re: [XeTeX] Polyglossia: Support for romanization of CJK

2011-06-16 Thread Gerrit
Am 16.06.2011 01:41, schrieb msk...@ansuz.sooke.bc.ca: I thought the original poster was talking about segments of text written in romanized Japanese as the only script - not phonetic guide texts (furigana) attached to Japanese script, nor equivalents in other languages. The issues you describe

Re: [XeTeX] Polyglossia: Support for romanization of CJK

2011-06-15 Thread mskala
On Wed, 15 Jun 2011, Mike "Pomax" Kamermans wrote: > Since phonetic guide texts for CJKV are tied to characters, I would consider > the most logical split one where the guide text is dictated by the character > boundaries, and the language used. Hyphenation for guide text would be > strongly tied t

Re: [XeTeX] Polyglossia: Support for romanization of CJK

2011-06-15 Thread Mike "Pomax" Kamermans
On 6/15/2011 11:44 AM, Gerrit wrote: Hello again, everyone, I am currently writing an article, in which I also have some romanization of Japanese. Until now, I have to define the hyphenation manually, which I think is a little bit of a nuisance. [snip] What do you think about that? Since

[XeTeX] Polyglossia: Support for romanization of CJK

2011-06-15 Thread Gerrit
Hello again, everyone, I am currently writing an article, in which I also have some romanization of Japanese. Until now, I have to define the hyphenation manually, which I think is a little bit of a nuisance. So I wonder if it is possible to include at least hyphenation for Japanese, Chinese