On Mon, Feb 10, 2025 at 04:54:24PM +0000, Gavin Smith wrote:
> On Mon, Feb 10, 2025 at 03:46:27PM +0000, Werner LEMBERG wrote:
> > 
> > >> And Japanese people are probably even offended because, as it
> > >> currently happens, almost all Japanese characters are represented
> > >> with Chinese syllables...
> > > 
> > > In addition to --no-transliterate-file-names, it is also possible to
> > > set the customization variable USE_UNIDECODE to 0, [...]
> > 
> > Japanese cross-reference labels detected while
> > `--transliterate-file-names` is active.  Please read section 'XXX' in
> > the manual how this influences the naming of split HTML files.
> > ```
> 
> If we turned off file name transliteration by default we wouldn't
> have any of these problems.

At least I can't see another way to avoid clashes in file names.  One
thing to think about is that the file names are truncated to 250
characters and non-transliterated file names could be longer in some
cases, therefore turning off transliteration by default could lead to
more cases of file name clashes for long non-ASCII node/anchor names
with the same 240 protected name beginning (48 characterss if no ASCII
at all and no spaces).  Transliterated could allow twice more characters
in favorable cases, but less than no transliteration in unfavourable
cases such as some scripts with long transliterations, or
transliterations to symbols that are afterwards escaped.

-- 
Pat

Reply via email to