Re: Encoding italic (was: A last missing link)

Victor Gaultney via Unicode Wed, 16 Jan 2019 03:44:15 -0800

James Kass wrote:

Concerns about statefulness in plain-text exist. Treating "italic" asan opening/closing "punctuation" may help get around such concerns.IIRC, it was proposed that the Egyptian cartouche be handled that way.

I do appreciate the technical issues surrounding statefulness and userexpectation when they select, copy, and paste. However that has alwaysbeen an issue. The Latin script (and many others) already has 'states',and that is reflected in the encoding of the markers that indicate thebeginning and end of those states (parens, quotes, etc.). In the Latinscript those markers are visually represented as separate glyphs,although sometimes enterprising font makers will use OpenType orGraphite to adjust those glyphs in context.

Encoding 'begin italic' and 'end italic' would introduce difficultieswhen partial strings are moved, etc. But that's no different than withcurrent punctuation. If you select the second half of a string thatincludes an end quote character you end up with a mismatched pair, withthe same problems of interpretation as selecting the second half of astring including an 'end italic' character. Apps have to deal with it,and do, as in code editors.

Apps (and font makers) can also choose how to deal with presentingstrings of text that are marked as italic. They can choose to presentvisual symbols to indicate begin/end, such as /this/. Or they canpresent it using the italic variant of the font, if available. Yes thatbrings up the issue of what to do if no italic counterpart is there. Butthat's already an issue with people using math characters forpseudo-italic. I'd guess that far, far more fonts in the world haveitalic counterparts than contain math chars, and the trend toward alwayshaving roman/italic matched pairs is something I've established in myresearch interviews.


Treating italic like punctuation is a win for a lot of people:

- Users get their italic content preserved in plain text

- Those who develop plain text apps (social media in particular) don'thave to build in a whole markup/markdown layer into their apps


- Misuse of math chars for pseudo-italic would likely disappear

- The text runs between markers remain intact, so they need no specialtreatment in searching, selecting, etc.

- It finally, and conclusively, would end the decades of the mess inHTML that surrounds <em> and <italic>.

My main point in suggesting that Unicode needs these characters is thatitalic has been used to indicate specific meaning - this text is somehowspecial - for over 400 years, and that content should be preserved inplain text.

Re: Encoding italic (was: A last missing link)

Reply via email to