On Mon, May 14, 2007 at 06:25:14PM +0200, Stefan Richter wrote: > John Anthony Kazos Jr. wrote: > > You can't post a patch to UTF-8 in plain text, because it isn't plain > > text. > > Of course it is, you said so... > > > MTAs and MUAs mangle the hell out of them. Look back in the archives > > for my postings. Once you've got something in UTF-8, -*THEN*- you can post > > in plain inline text. > > ...yourself. Just send as > > Content-type: text/plain; charset="utf-8" > > Of course a patch like that will usually contain several character sets, > but that's no problem as long as the byte codes are preserved and we can > easily see how the end result looks like. The important thing is that > the submitter's and committer's MUAs do it right and reviewers' MUAs do > it sort of right. Also, nobody cares if there is still some pre-WWI > 7bit-only MTA operated in some lone desert town. And if some web > archive of LKML isn't compatible to internet mail, then that's tough luck.
The problem is that the unconverted characters are always invalid UTF-8 [1], and it's 100% correct for a MUA to convert these invalid UTF-8 characters to replacement characters - if a MUA claims to send valid charset="utf-8", it is simply not allowed to preserve the byte codes. > Stefan Richter cu Adrian [1] except for some cases with two or more non 7bit ASCII characters in a row -- "Is there not promise of rain?" Ling Tan asked suddenly out of the darkness. There had been need of rain for many days. "Only a promise," Lao Er said. Pearl S. Buck - Dragon Seed - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/