#3608: Character Š wrong in index and status on Mac OS X
--------------------+-------------------------------------------------------
 Reporter:  kolcon  |       Owner:  me      
     Type:  defect  |      Status:  accepted
 Priority:  minor   |   Milestone:  1.6     
Component:  mutt    |     Version:  1.5.21  
 Keywords:          |  
--------------------+-------------------------------------------------------
Changes (by me):

  * owner:  mutt-dev => me
  * status:  new => accepted
  * milestone:  => 1.6


Comment:

 It appears the problem is that the rfc822 parsing code is not unicode-
 aware.  In particular, the next_token() function using ISSPACE, and what
 is likely happening is that the second byte of the Š character is being
 stripped, which causes mutt_paddstr() to substitute the replacement char �
 when mbrtowc() fails.

 Since non-ascii characters are not permitted to appear in header fields,
 this is normally not a problem (and if it is, we blame the sender's broken
 software for not rfc2047-encoding it).

 However, in this case, the user's muttrc has an alias with utf-8.  And
 this is probably similar to the situation in #2956 when the user edits the
 message header, because we are not cruel enough to let the user directly
 edit the RFC2047 encoding.

 More thought is required here.

-- 
Ticket URL: <http://dev.mutt.org/trac/ticket/3608#comment:11>
Mutt <http://www.mutt.org/>
The Mutt mail user agent

Reply via email to