#3608: Character Š wrong in index and status on Mac OS X --------------------+------------------------------------------------------- Reporter: kolcon | Owner: me Type: defect | Status: accepted Priority: minor | Milestone: 1.6 Component: mutt | Version: 1.5.21 Keywords: | --------------------+------------------------------------------------------- Changes (by me):
* owner: mutt-dev => me * status: new => accepted * milestone: => 1.6 Comment: It appears the problem is that the rfc822 parsing code is not unicode- aware. In particular, the next_token() function using ISSPACE, and what is likely happening is that the second byte of the Š character is being stripped, which causes mutt_paddstr() to substitute the replacement char � when mbrtowc() fails. Since non-ascii characters are not permitted to appear in header fields, this is normally not a problem (and if it is, we blame the sender's broken software for not rfc2047-encoding it). However, in this case, the user's muttrc has an alias with utf-8. And this is probably similar to the situation in #2956 when the user edits the message header, because we are not cruel enough to let the user directly edit the RFC2047 encoding. More thought is required here. -- Ticket URL: <http://dev.mutt.org/trac/ticket/3608#comment:11> Mutt <http://www.mutt.org/> The Mutt mail user agent