John Bauman wrote: > UTF-8 shouldn't need a BOM, as it is designed for character streams, and > there is only one logical ordering of the bytes. Only UTF-16 and greater > should output a BOM, AFAIK.
Yes and no. Yes, UTF-8 does not need a BOM to identify endianness. No, usage of the BOM with UTF-8 is explicitly allowed in the Unicode specs (so output of the BOM doesn't *have* to be restricted to UTF-16 and greater), and the BOM has a well-defined meaning for UTF-8 (namely, as the UTF-8 signature). Regards, Martin -- http://mail.python.org/mailman/listinfo/python-list