On 12.11.2019 16:13, Nathan Hartman wrote: > I'm guessing > there was a desire to do more than print the offending hex codes but > I don't know what else you could do.
If it's invalid UTF-8, there's nothing you can do. There's a lot you can do if the UTF-8 is valid, but the target locale can't represent the characters. Transliteration is a thing, but if we go down that path, it's hard to decide when to stop. Subversion is not a linguistics tool, it's a version control tool. -- Brane P.S.: I think there's even code that removes diacritical marks from the source Unicode to make conversion to the target locale easier, at least for Latin-based locales. That's already a kind of transliteration, and even that may be way too much.

