On Tue, Oct 29, 2019, at 14:20, Barry Leiba wrote:
> Jim, thanks for accepting most of my edits and for addressing my 
> question about the password characters. It just seems odd to me that 
> you can't use << my#x20#x20#x20pw >> as a password, but you can use, 
> say, << my#x20#xA0#x20pw >> (or any of a number of other variations 
> with other Unicode space characters. But the reason sounds... 
> reasonable, so: post a rev and I'll last-call it.

As a data point, the current NIST recommendations on passwords:
https://pages.nist.gov/800-63-3/sp800-63b.html#memsecretver

Verifiers SHALL require subscriber-chosen memorized secrets to be at least 8 
characters in length. Verifiers SHOULD permit subscriber-chosen memorized 
secrets at least 64 characters in length. All printing ASCII [RFC 20] 
characters as well as the space character SHOULD be acceptable in memorized 
secrets. Unicode [ISO/ISC 10646] characters SHOULD be accepted as well. To make 
allowances for likely mistyping, verifiers MAY replace multiple consecutive 
space characters with a single space character prior to verification, provided 
that the result is at least 8 characters in length. Truncation of the secret 
SHALL NOT be performed. For purposes of the above length requirements, each 
Unicode code point SHALL be counted as a single character.
</quote>

Note the "MAY replace multiple consecutive space characters".

Also, as I hinted in the past during discussion on this extension, the whole 
area of "internationalized" identifiers,
such as usernames and passwords has been explored in the PRECIS ("Preparation, 
Enforcement, and Comparison of Internationalized Strings in Application 
Protocols") set of RFCs,
and specifically RFC 8265 "Preparation, Enforcement, and Comparison of 
Internationalized Strings
Representing Usernames and Passwords".
In short it defines an "OpaqueString" profile, to be applied for passwords.
Among other things it has this:
.  Additional Mapping Rule: Any instances of non-ASCII space MUST be
       mapped to SPACE (U+0020); a non-ASCII space is any Unicode code
       point having a Unicode general category of "Zs", with the
       exception of SPACE (U+0020).  As was the case in RFC 4013, the
       inclusion of only SPACE (U+0020) prevents confusion with various
       non-ASCII space code points, many of which are difficult to
       reproduce across different input methods.

(note also RFC8264 ยง5.3 for a discussion about spaces)

I am still of the opinion that we should have used/referenced that work more.


-- 
  Patrick Mevzek
  p...@dotandco.com

_______________________________________________
regext mailing list
regext@ietf.org
https://www.ietf.org/mailman/listinfo/regext

Reply via email to