I'm doing something that's got to be rare if it's been done before: I'm dictating a groff document.
The dictation program lets me say things like "mdash", and the output is UTF-8 (so says file(1)). I prefer to work in ASCII. preconv doesn't do what I think I want. It produces outputs that aren't highly mnemonic e.g. \[u00E2]\[u0080]\[u0093] I'm not sure what that's supposed to be, but what I'd like to see is \[em] I'm therefore look for (if you will) "ffort": something that converts machine-readable codes in to human readable codes. Looking at the groff_char manual, it seems like such a translation is possible, even if many Unicode points map to the same groff symbol name. My questions: 1. Does such a utility exist? 2. Am I the only one who wishes for such a thing? 3. Is the problem well specified? Do you see any ambiguity in the proposed translation? It doesn't seem terribly difficult, but maybe I'm overlooking something. Thank you. --jkl