Re: 1.23: UTF-8 device produces mysterious characters

2022-09-14 Thread Dave Kemper
On 9/13/22, Steffen Nurpmeso wrote: > How about [12066c659ea454a663483a12181a0c33cf416b22]. > This looks promising. How about it? Your question is ambiguous. The commit you cite concerns changes to the man and mdoc macros, which Branden mentioned in his first reply in the thread. Are you using

Re: 1.23: UTF-8 device produces mysterious characters

2022-09-14 Thread Steffen Nurpmeso
Ralph Corderoy wrote in <20220914071616.2e89021...@orac.inputplus.co.uk>: |Hi Steffen, | |>>> En dash would look nice, i could imagine. |>> |>> Those ASCII ‘-’ above should be rendered as a hyphen in nicely |>> typeset output. An en-dash is far too big. Oh, there's another |>> one! |...

Re: 1.23: UTF-8 device produces mysterious characters

2022-09-14 Thread Ralph Corderoy
Hi Steffen, > > > En dash would look nice, i could imagine. > > > > Those ASCII ‘-’ above should be rendered as a hyphen in nicely > > typeset output. An en-dash is far too big. Oh, there's another > > one! ... > But i was talking -Tutf8, and these are fixed width font Given we use terminal em

Re: 1.23: UTF-8 device produces mysterious characters

2022-09-13 Thread Dave Kemper
On 9/13/22, G. Branden Robinson wrote: >> Or look at the Unicode standard, where real great minds with >> incredible multi-national professional life careers are involved, >> get the official PDF (hr-hrm, i have not updated since Unicode >> 13..), combined words are separated with hyphen-minus, _n

Re: 1.23: UTF-8 device produces mysterious characters

2022-09-13 Thread Steffen Nurpmeso
Dave Kemper wrote in : How about [12066c659ea454a663483a12181a0c33cf416b22]. This looks promising. Thank you. --steffen | |Der Kragenbaer,The moon bear, |der holt sich munter he cheerfully and one by one |einen nach dem anderen runter wa.ks himself off |(By Robert Ger

Re: 1.23: UTF-8 device produces mysterious characters

2022-09-13 Thread Dave Kemper
On 9/13/22, Steffen Nurpmeso wrote: > Dave Kemper wrote in > : > |Groff has converted an input U+002D to a proper hyphen in typeset > |output for decades. It has done so in UTF-8 output since at least > |groff 1.19.2. > > This is not true. I never have seen anything but hyphen-minus on > my

Re: 1.23: UTF-8 device produces mysterious characters

2022-09-13 Thread Steffen Nurpmeso
Ralph Corderoy wrote in <20220913084152.0211621...@orac.inputplus.co.uk>: |Hi Steffen, | |> Hyphen is good at the end of line when a word is hyphenated, otherwise |> it is misplaced. | |Not in English. A hyphen may be used to join compound adjectives, as a |two-minute Google would show. :

Re: 1.23: UTF-8 device produces mysterious characters

2022-09-13 Thread Steffen Nurpmeso
G. Branden Robinson wrote in <20220913053400.7lmdp2qpxb6zweei@illithid>: |At 2022-09-12T23:41:34+0200, Steffen Nurpmeso wrote: |> This is not a hyphenated word. |[rearranging this a bit] |> En dash would look nice, i could imagine. | |Then use en dashes in your input. | | on\[en]loop\[en]

Re: 1.23: UTF-8 device produces mysterious characters

2022-09-13 Thread Steffen Nurpmeso
Dave Kemper wrote in : |On 9/12/22, Steffen Nurpmeso wrote: ... |Groff has converted an input U+002D to a proper hyphen in typeset |output for decades. It has done so in UTF-8 output since at least |groff 1.19.2. ... This is not true. I never have seen anything but hyphen-minus on my box

Re: 1.23: UTF-8 device produces mysterious characters

2022-09-13 Thread Ralph Corderoy
Hi Steffen, > Hyphen is good at the end of line when a word is hyphenated, otherwise > it is misplaced. Not in English. A hyphen may be used to join compound adjectives, as a two-minute Google would show. :-) An ‘American-football player’ isn't necessarily American whereas an ‘American footbal

Re: 1.23: UTF-8 device produces mysterious characters

2022-09-12 Thread G. Branden Robinson
Hi Steffen, At 2022-09-12T23:41:34+0200, Steffen Nurpmeso wrote: > This is not a hyphenated word. [rearranging this a bit] > En dash would look nice, i could imagine. Then use en dashes in your input. on\[en]loop\[en]main\[en] tick The "en" special character identifier is not portable back to

Re: 1.23: UTF-8 device produces mysterious characters

2022-09-12 Thread Dave Kemper
On 9/12/22, Steffen Nurpmeso wrote: > Flags are different, because often you want this to be a U+2013 > EN DASH. Ie, you want to make it _longer_ than a hyphen-minus. Not if you want it to paste correctly into a shell. Commands use U+002D for flags, not U+2013. But for natural languages, U+002D

Re: 1.23: UTF-8 device produces mysterious characters

2022-09-12 Thread Steffen Nurpmeso
Hello Branden. G. Branden Robinson wrote in <20220912144641.q2r65kkfpiej4u2u@illithid>: |At 2022-09-12T15:43:00+0200, Steffen Nurpmeso wrote: |> I have problems with the UTF-8 device, it shows |> |> on‐main‐loop‐tick |> instead of |> on-main-loop-tock |> |> ie U+2010 instead of hyph

Re: 1.23: UTF-8 device produces mysterious characters

2022-09-12 Thread G. Branden Robinson
Hi Steffen, At 2022-09-12T15:43:00+0200, Steffen Nurpmeso wrote: > I have problems with the UTF-8 device, it shows > > on‐main‐loop‐tick > instead of > on-main-loop-tock > > ie U+2010 instead of hyphen-minus U+002D. > > The above does not feel right, and searching is impossible! > I would e

1.23: UTF-8 device produces mysterious characters

2022-09-12 Thread Steffen Nurpmeso
Hello! I have problems with the UTF-8 device, it shows on‐main‐loop‐tick instead of on-main-loop-tock ie U+2010 instead of hyphen-minus U+002D. The above does not feel right, and searching is impossible! I would expect U+2010 HYPHEN in hyphenation, but not as a regular combiner aka delimiter