On 02/05/06, Peter Ruskin <[EMAIL PROTECTED]> wrote:
On Tuesday 02 May 2006 21:57, Alexander Skwar wrote:
> Hello.
>
> On my system, I've got some files, in which "special characters"
> (eg. German umlaute, like äöü or ß) are encoded in a rather
> strange way.
>
> Eg.:
> [2005-04-14 20-17-09] (0766) Sandra mit Cassandra auf Arm,
> Klinikum Barmen, Tag 23 - Schwarz-Wei�:009f.jpg
>
> "Schwarz-Wei�:009f" should read "Schwarz-Weiß".
>
> What I don't understand is, why the "ß" has been (wrongly)
> encoded as �:009f.
>
> Why is that happening?
>
Probably something with UTF-6. From gucharmap:
ß
U+00DF LATIN SMALL LETTER SHARP S
General Character Properties
Unicode category: Letter, Lowercase
Various Useful Representations
UTF-8: 0xC3 0x9F
Octal escaped UTF-8: \303\237
Decimal entity reference: ß
Annotations and Cross References
Alias names:
• Eszett
Notes:
• German
• uppercase is "SS"
• in origin a ligature of U+017F LATIN SMALL LETTER LONG S and
U+0073 LATIN SMALL LETTER S
See also:
• U+03B2 GREEK SMALL LETTER BETA
Nope, its defintely not the Greek lower case beta.
The above assumes that you have already built in the correct system
language in your kernel and you have chosen an appropriate character
set for your console.
HTH.
--
Regards,
Mick
--
gentoo-user@gentoo.org mailing list