Well, this has become a fairly long thread. I'll just note that I
don't have strong feelings about this specific proposal.

E.g. if the language tags were obvious and self-explanatory in the
text, like the insets that can be removed just by pressing backspace,
this would overcome some of my dislike for the automatic addition of
language tags.

On Sat, May 16, 2009 at 3:23 PM, Jürgen Spitzmüller <sp...@lyx.org> wrote:
> John McCabe-Dansted wrote:
>> I also consider it strange that the language can change within a word.
>
> Define "word" (for languages such as, say, English, German and Japanese).

In English anything separated by a space, punctuation etc. Finding
word breaks in Japanese is a bit harder.

It probably isn't actually worth doing anything about this oddness.

>> In my case \selectlanguage{american} etc. has popped up up a number of
>> times, and in all cases the LaTeX I want is exactly the LaTeX that
>> results when I remove \selectlanguage{[^}]*}.
>>
>> > However, if I copy a French sentence into an English document, I see no
>> > reason whatsoever to make the sentence English. French is French and not
>> > English, after all. Resetting the language would be a bug, IMHO.
>>
>> In my case my documents are pure English, but I might:
>> 1) Copy an acronym (or character) from a French document (unlikely)
>
> Usually, Acronyms are language-specific as well (cf. IPA [International
> Phonetic Association] vs. API [Association Phonétique Internationale]).

This still wouldn't help me, because I don't have any French language
documents. I'd be pasting from a French language website, if I was
very lucky I might be pasting from an English language document where
I had explicitly set that text to French; in which case copying over
the tags might make sense.

Point being that for a substantial class of users, the LaTeX model
where copying text only copies tags in that text is more useful for
language except in a few very rare cases. As argued elsewhere it may
also be useful for copying e.g. a word from a section title as well.

>> 2) Copy text from English US to English UK -- IMHO An English UK word
>> in an English US document is actually just a poorly spelt US English
>> word. Actually it is likely to even be a correctly spelt English US
>> word.
>
> Why? IMHO it is correct to keep it marked es English (UK), The author hgas to
> decide if he leaves it in UK spelling or if he adapts the spelling to US
> convention (and reset the language).

The spellchecker will pickup the misspellings, unless LyX keeps it
marked as English (UK).

Admittedly, Leaving it as UK might be useful if it was a direct quote,
and had proper quotation marks around it. (And yes I realize that
detecting quotations in every possible language could be difficult,
even for languages that actually have quotation marks. However I use
citations rather than quotes, so again this isn't something that makes
a difference)

>> In principle I may be submitting a document to an organization that
>> requires that all text be in Language X (and only language X), in
>> which case any LyX document I submit that contains language markup is
>> wrong, just as if I had included a Chapter in an article.
>
> But not if you use different languages (if this organization is not completely
> crazy, that is).

If I use a Chapter is a journal article, then it clearly should be
formatted as a Chapter, rather than e.g. standard?

If I did accidentally insert a Chapter into a article, it would at
least be a lot easier to find in proof-reading than multiple
"languages"
in a document that is actually monolingual. (Being able to add
Chapters into e.g. amsart it could actually be useful, often I've
wanted to do "something like style X, but with a feature from Y")

Even when using a foreign word it is common to convert it to a local
word. Japanese would commonly write English words in Katakana, and I
don't use Mitsubishi as a Japanese word, and I certainly don't write
it as 三菱 in an English document.

>> More likely, the receiving institution really wouldn't care whether I
>> use British or American English, so long as I am consistent. So in
>> this case hard-coding either British or American English would be
>> fine, but allowing both is again in some sense incorrect.
>
> But then, again, hyphenation might be just wrong.

Well I am not advocating hardcoding, never-the-less it would
presumably correct according to one of the two acceptable definitions
of correct. (at least if you spell words according to the hard-coded
language).

>> I note that LyX automatically removes \Chapter when pasting to an
>> article.  IMHO it also makes perfect sense to also remove
>> "\selectlanguage{british}" "\selectlanguage{american}" etc. when
>> pasting to a monolingual document. Perhaps all documents could start
>> in a "defacto monolingual" state, and when the first \selectlanguage
>> would be inserted the user could be asked whether they want a
>> "monolingual" document or "multilingual" document. If we were to get
>> fancy we could have documents where german/british are allowed but not
>> american.
>
> This is a very bad idea that will mixup any multilingual document. And, please

Clearly a setting a multilingual document to "monolingual" isn't going to help.

> excuse me, it strikes me rather monolingual-English centric, while LyX aims at
> being truly multilingual.

Presumably, not all monolingual users speak English; I would imagine
that French users may have similar issues with regard to French/French
Canadian.

In any case I don't see why this is an objection to this feature (I
can certainly see why you wouldn't want to work on it yourself).

I don't see any reason for LyX to not have features that only help
monolingual users, or rather users that only prepare monolingual
documents in LyX. I've studied other languages, and I could write a a
truly multilingual document, but then my readers would hate me. Heck,
I've had comments from reviewers that the "a" and the "e" in the
grapheme \ae{} look mushed together.

The main advantage that the lack this feature has for such users is
that the aggravation it causes them, might induce them to better
understand the difference between UK and US English. However if you
fix all the bugs wrt. ERT etc., and  then most of them will probably
think the blue overline is just form of screen corruption (it doesn't
appear in the PDF after all). If nothing else, the
monomodal/mulitiligual dialog might clue them into what the blue line
actually means.

Also if this dialog popups up immediately when you attempt to paste
the text in, it would be obvious to you which source document had the
wrong (sub) language.

-- 
John C. McCabe-Dansted
PhD Student
University of Western Australia

Reply via email to