Before we have any more of this discussion, let me just emphasize that my original concern was with the use of the Unicode emdash in the User's Guide and Tutorial, where it gives bad results, according to the standards of English typography. That is what I was proposing to change. The issue about unicodesymbols was raised by someone else and is quite orthogonal. Nonetheless, we can discuss it. There is another issue about how these things are rendered on-screen, and yet another about output, again all orthogonal. But they too can be discussed.


On 01/14/2010 05:06 PM, Guenter Milde wrote:
On 2010-01-14, rgheck wrote:
On 01/14/2010 11:39 AM, Jean-Marc Lasgouttes wrote:
rgheck<rgh...@bobjweil.com>   writes:
The input ligature can have unwanted side-effects.

Copy the following into a LyX window:

    To get help, use
    # ls --help

Look at the PDF. I copied the text back from xpdf and got:

Of course you do, because it is an endash. But if you put it in listings, or in a LyXCode environment, you get what you would expect to get in those cases. In the LyXCode case, that's because we find this:

ls~-{}-help

in the source. So LyX just handles this for you. I don't see anything confusing about it.

As Jurgen said, do you have an example?
Now, change the command to typewriter font and try again. This time it
helps, as typewriter is one example of a font without this input
ligature. But you also don't get an em-dash even if you mean one.

This makes perfect sense: Typewriters don't have em-dash keys. So the em-dash is rendered as "---". Which is pretty much how people used to type it.

Why does LyX handle this differently from "~" and "\"?

I'm not sure I understand this question.

And, as I've said, the Unicode version gives the wrong spacing.
In the GUI window or in the output?

Output. If you enter "text[emdash]text", you do not get a line break after the emdash. To get it, you have to mess around in ways you shouldn't have to mess around. I'm not worried about spacing around the em-dash. Though...

Spacing around an em-dash is both a font issue and a typographic one:

In German typography you put spaces around the "Gedankenstrich" (and
use an en-dash):

   Halbgeviertstrich, länger als das Divis, steht zwischen zwei
   Leerzeichen, außer in Verbindung mit Satzzeichen.

   -- typokurz – Einige wichtige typografische Regeln

I don't think people writing

   He came — and went.

will use correct spacing when told to use three hypens:

   He came---and went.

spacing is the same.

...it seems to me that this is *exactly* the kind of thing writers should not have to worry about, i.e., that ought to be handled automatically by the typesetting engine. If LaTeX doesn't do that, then there should be a package that does it, if there isn't one already. If there's no such package, then I'd be happy to consider special, language-specific handling of spacing around "---" and "--", if you wanted to propose that. But I would not want to have special spacing around the Unicode glyph, since it is just a character, and maybe I don't want special spacing around it: I meant to be able to use it as a normal character, if I want to do so.

And if the output is affected, the unicodesymbols file could replace
the em-dash with "\<tiny-space>\textemdash\<tiny-space{}"

It definitely should not do that, since that is completely wrong for English typography. In English, there are no spaces, but you can break after the emdash, though not before it. More importantly, the Unicode emdash should translate as an emdash and nothing else. Which is a reason to keep it as it is, since the translation as "---" can be wrong, as you have pointed out.

rh

Reply via email to