Andre Poenitz wrote:

> Or slurp in the contents of the .tex file and try various encodings
> until we find one that "does the trick", possibly after cutting it
> into parts for which we know that the encoding stays constant.

Yes. Note that this can become quite tricky, though: Some variable width
encodings (shift-jis and big5 are examples supported by the CJK package)
are not as nice as utf8, and do not guarantee that the second or third byte
of a code point does not contain any byte from the ASCII range. That means
that you cannot preparse the file (for cutting it into pieces) without
knowing the encoding of the currently read piece.
For the same reason these encodings cannot be used with standard LaTeX
compilers, but AFAIK there are specialized variants that understand these
encodings.


Georg

Reply via email to