On Thu, 2007-12-06 at 14:51 +0100, Peter J. Holzer wrote:
> > 1. Don't do anything until there is an SMTP RFC which covers IDN.
> 
> You misunderstood what I wanted a proposal for. I did not want a
> proposal for IDN. I wanted one for a generalization of the parser so
> that plugins can be used to change the acceptable syntax.

Not really.  I object to both things ;-)  However, I will now read the
rest of your explanation on the second issue.  I doubt that I object
enough to make it worth another post. 

IDN is much worse.

Please see my follow-up under the topic eai[*].  Work on SMTP is barely
getting started but it appears that IDN and RFC 3490 are a collossal
mistake (imo, not at all h) and should be avoided except where
necessary.  I think it would be far better to start with the new draft
RFCs then to blindly implement RFC 3490.

There seems to be an ESMTP extension (possibly only proposed) for
allowing UTF-8 but this may just be for headers.  This is referenced in
the new eai drafts.

I don't know whether RFC 3490 must be used in the envelope.  Probably.
That will be extremely annoying as you will have to translate addresses
to compare.

I believe that the ACE (see below) adopted was probably DUDE (which
provides a one-to-one mapping to UTF-8, iirc -- some of the other
encodings proposed did not, i think).  That should be clear from reading
3490.

Bernstein demonstrated that in 2001 UTF-8 already worked in the envelope
for djbdns and qmail transparently and that trivial patches were
required for bind and sendmail.  Microsoft applications were already
able to use UTF-8.  Only after RFC 3490 was adopted, did microsoft (and
some others) change their apps.  So everything except DNS follows
RFC2277 (i may have the number wrong -- from memory it's all 2s and 7s),
which requires UTF-8 on the wire.  Hopefully RFC 3490 has some allowance
for a transition to UTF-8 ... some of the IDN WG suggested that
replacing DNS was a better idea.

See below for further amusement.

[*] Subject: Email Address Internationalization

--gh

Summary of the IDN WG email list.
---------------------------------

First of all, a few people on the IDN working group, including the
chair, insisted that an ACE solution (ASCII Character-set Encoding -- or
something like that) was absolutely required because of compatibility
with existing 7-bit applications.  In particular BIND and SENDMAIL.

Bernstein argued that there were existing trivial patches or workarounds
for BIND and SENDMAIL which would suffice.  He proposed mandating these
fixes with a reasonable time period allowed for transition.

The difference between these two stands is that the first avoids short
term pain for many sysadmins while the second avoids long term pain for
all programmers writing network apps (perhaps a smaller group than the
first).  IETF has a tradition of taking the first path and Bernstein is
quite vehement regarding the long term adverse consequences.

He also pointed to RFC 2277, which required all future protocols to
support UTF-8 on the wire.

As usual, Bernstein was accused of ad-hominem attacks ... I don't agree
with this assessment but I do agree that he is not particularly
diplomatic.  He did lose it completely at one point and used all caps
for 3 sentences.

He asked several times for a vote that there was a concensus that UTF-8
would be adopted eventually but his opponents claimed that he must write
an RFC proposal first and he never did that, as far as I can tell.  I
don't know if this is considered in 3490 ... it should, at least,
reference 2277 and justify non-adherence ... but I don't know until I
read it and I may not bother.



Reply via email to