On 12.11.2019 16:13, Nathan Hartman wrote:
> I'm guessing
> there was a desire to do more than print the offending hex codes but
> I don't know what else you could do.

If it's invalid UTF-8, there's nothing you can do.

There's a lot you can do if the UTF-8 is valid, but the target locale
can't represent the characters. Transliteration is a thing, but if we go
down that path, it's hard to decide when to stop. Subversion is not a
linguistics tool, it's a version control tool.

-- Brane

P.S.: I think there's even code that removes diacritical marks from the
source Unicode to make conversion to the target locale easier, at least
for Latin-based locales. That's already a kind of transliteration, and
even that may be way too much.

Reply via email to