ekr, Thanks for circling back to the WG with the consolidated list!
Cheers, spt > On Apr 22, 2021, at 13:31, Eric Rescorla <e...@rtfm.com> wrote: > > I have posted draft-ietf-tls-dtls13-42, addressing the IESG > Feedback. Thanks to everyone who provided reviews. Here is a > description how I handled comments. If there is somebody whose > feedback I missed please let me know. > > -Ekr > > **** Erik Kline > > > [ section 4.4 ] > > > > * "respectively" -> "respectively." > > > > * Could a DTLS implementation packetize to a min-MTU for an IP version > > and avoid all pMTU issues? Such a strategy would probably be poor for > > IPv4 but might be acceptable for IPv6 communications. > > Maybe, but I think we probably don't need to say much. > > > > [ section 4.5.3 ] > > > > * "MUST NOT used" -> "MUST NOT be used" > > Fixed. > > > [ section 5.8.4 ] > > > > * "NOT have send" -> "NOT send", I think > > Fixed. > > > [ section 6 ] > > > > * "which are needed to encrypt to decrypt"? > > Fixed. > > > **** Francesca Palombini > > > section 2. Conventions and Terminology > > > > FP: Please spell out that network byte order (most significant byte first) > > is used throughout the document. > > Done. > > > > > Once the client has transmitted the ClientHello message, it expects > > to see a HelloRetryRequest or a ServerHello from the server. > > However, if the server's message is lost, the client knows that > > either the ClientHello or the response from the server has been lost > > and retransmits. When the server receives the retransmission, it > > knows to retransmit. > > > > FP: It would be good to mention retransmission max times here. > > DTLS actually doesn't have an overall timeout. This is left to > the discretion of the implementation. It does have a maxmimum > backoff, butbackoff isn't mentioned at all. > > > > | | /+----------------+\ > > | 31 < OCT < 64 -+--> |DTLS Ciphertext | > > | | |(header bits | > > | else | | start with 001)| > > | | | /+-------+--------+\ > > > > The value for the "DTLS-OK" column is "Y". IANA is requested to > > reserve the content type range 32-63 so that content types in this > > range are not allocated. > > > > FP: IANA is asked to reserve 32-63, but I could not see any explanation for > > that. I would like to see it justified in section 4.1 or in the respective > > IANA section. > > Done. > > > > > > > fragmentation, clients of the DTLS record layer SHOULD attempt to > > size records so that they fit within any PMTU estimates obtained from > > the record layer. > > > > FP: First time PMTU appears, please expand and add reference. > > Done. > > > performing PMTU discovery, whether via [RFC1191] or [RFC4821] > > mechanisms. In particular: > > > > FP: I think this is missing areference to RFC 8201 since IPv6 is mentioned > > below. > > Done. > > > Any TLS cipher suite that is specified for use with DTLS MUST define > > limits on the use of the associated AEAD function that preserves > > margins for both confidentiality and integrity. That is, limits MUST > > be specified for the number of packets that can be authenticated and > > for the number of packets that can fail authentication before a key > > update is required. Providing a reference to any analysis upon which > > values are based - and any assumptions used in that analysis - allows > > limits to be adapted to varying usage conditions. > > > > FP: This seems important enough that it should be highlighted for the > > experts reviewing the registration. I see that > > https://www.iana.org/assignments/tls-parameters/tls-parameters.xhtml#tls-parameters-4 > > has a number of notes, maybe that would be enough, or maybe add it (as an > > update?) to RFC 8447? > > Done. > > > zero > > length vector (i.e., a single zero byte length field). > > > > FP: I suggest using TLS 1.3 terminology of "zero-length vector (i.e., a > > zero-valued single byte length field)" > > Done. > > > flow shown in Figure 11 if the client does not send the ACK message > > > > FP: s/11/12 > > Done. > > > ***** Martin Duke > > > COMMENTS: > > Sec 2. It might be useful to introduce the term "epoch" in the glossary, > > for those who read this front to back. > > Done. > > > > Sec 4.2.3: "The encrypted sequence number is computed by XORing the leading > > bytes of the Mask with the sequence number. Decryption is accomplished by > > the same process." > > > > The text is unclear if the XOR is applied to the expanded sequence number > > or merely the 1-2 octets on the wire. I presume it's the latter, but this > > should be clarified. > > Fixed. > > > > Sec 4.2.3: It's implied here that the sn_key rotates with the epoch. As > > this is different from QUIC, it's probably worth spelling out. > > Fixed. > > > > Sec 5.1 is a bit vague about the amplification limit; why not at least > > RECOMMEND 3, as we've converged on this elsewhere? > > Added. > > > > Sec 5.1. Reading between the lines, it's clear that the cookie can't be > > used as address verification across connections in the way that a NEW_TOKEN > > token is. It would be good to spell this out for clients -- use the > > resumption token or whatever instead. > > Added some text. > > > > Sec 7.2 "As noted above, the receipt of any record responding to a given > > flight MUST be taken as an implicit acknowledgement for the entire flight." > > I think this should be s/entire flight/entire previous flight? > > Added some text. > > > > Sec 7.2 "Upon receipt of an ACK that leaves it with only some messages from > > a flight having been acknowledged an implementation SHOULD retransmit the > > unacknowledged messages or fragments." > > > > This language appears inconsistent with Figure 12, where Record 1 has not > > been acknowledged but is also not retransmitted. It appears there is an > > implied handling of empty ACKs that isn't written down anywhere in Sec 7.2 > > This is just a bug in the diagram. Good catch. Fixed. > > > > > Sec 9. Should there be any flow control limits on new_connection_id? Or > > should receivers be free to simply drop CIDs they can't handle? It might be > > good to specify. > > Added some text. > > > > Finally, a really weird one. Reading this document and references to > > connection ID prompted to me to think how QUIC-LB could apply to DTLS. The > > result is here: https://github.com/quicwg/load-balancers/pull/106/files. > > Please note the rather unfortunate third-to-last paragraph. I'm happy to > > take the answer that this use case doesn't matter, since I made it up > > today. But if it does, it would be very helpful if (1) DTLS 1.3 clients > > MUST include a connection_id extension in their ClientHello, even if zero > > length, and/or (2) this draft updated 4.1.4 of 8446 to allow the server to > > include connection_id in HelloRetryRequest even if the client didn't offer > > it. Thoughts? > > Addressed. > > > > > NITS: > > 5.2 s/select(HandshakeType)/select(msg_type). Though with pseudocode your > > mileage may vary as to what's clearer. > > Agreed. > > > > 5.7 s/consitute/constitute > > Fixed. > > > > Sec 5.7 In table 1, why include one ACK in the diagram but not the other? > > It's clear from the note, but the figure is a weird omission. > > I don't think I understand why we did this, so I just removed it. > > > > **** Lars Eggert > > Indicated minor changes made. > > > **** Zaheduzzaman Sarker > > > This was very well written document. Thanks for this. > > > > Minor observations below- > > > > * Section 3.1 : > > - Once the client has transmitted the ClientHello message, it expects to > > see a HelloRetryRequest or a ServerHello from the server. However, if the > > server's message is lost, the client knows that either the ClientHello or > > the response from the server has been lost and retransmits. > > > > is this supposed to mean when the timer expires the client knows either the > > ClientHello or the response from the server has been lost? the current text > > does not imply that - the server's message is lost is an interpretation of > > timer expired event. > > > > Fixed. > > > > - The server also maintains a retransmission timer and retransmits when > > that timer expires. > > > > The way it is written following the previous paragraph, almost made me feel > > that the server is also maintaining a timer for the client hello. It would > > be nicer if some text explains the usage of timers at the server to break > > the continuous read from previous paragraph. > > Fixed. > > > > * Section 3.3: I would add a reference to section 4.4. > > Done. > > > > * Section 4.5.2: I assume the silent discard of invalid records will not > > impact the timers, is that a valid assumption? if yes, then it would be > > good if this is clarified in the text. > > This is correct, but I don't quite see why one would think it does, as they > don't even get to the point where you it would impact the timer. Anyway, > added some text. > > > > > * Section 5.8.1: > > > > Because DTLS clients send the first message (ClientHello), they start > > in the PREPARING state. DTLS servers start in the WAITING state, but with > > empty buffers and no retransmit timer > > > > This is repeated twice in the section, is there any reason for that? > > Fixed. > > > **** John Scudder > > COMMENTS: > > > Section 3.1: > > > > I found the explanatory text to be confusing. You start with a figure > > illustrating a lost HelloRetryRequest. Then you tell me the server > > maintains a rexmit timer: > > > > The server also maintains a retransmission timer and retransmits when > > that timer expires. > > > > But then you immediately tell me that it actually doesn’t: > > > > Note that timeout and retransmission do not apply to the > > HelloRetryRequest since this would require creating state on the > > server. The HelloRetryRequest is designed to be small enough that it > > will not itself be fragmented, thus avoiding concerns about > > interleaving multiple HelloRetryRequests. > > > > I presume that if I added some more words to this, your intent is that the > > server maintains a retransmission timer *for messages other than > > HelloRetryRequest*. As written, it gave me some whiplash. > > Fixed. > > > > Section 4.2.1: > > > > In general, > > implementations SHOULD discard records from earlier epochs, but if > > packet loss causes noticeable problems implementations MAY choose to > > retain keying material from previous epochs for up to the default MSL > > specified for TCP [RFC0793] to allow for packet reordering. > > > > It seems to me as though “if packet loss causes noticeable problems” is > > saying either too much, or not enough. Not enough: problems for whom? > > Noticeable by whom? How is this determined? Do you really mean I’m supposed > > to work this out dynamically as the text sort-of implies? Too much: if > > you’re not going to answer the foregoing, maybe don’t taunt me, and omit > > the clause entirely? Or, possibly a less vague rewrite could be in the > > nature of “if providing service to an application that is especially > > sensitive to packet loss”. > > Removed. > > > > NITS: > > > > Section 2: > > > > “The reader is also as to be familiar” s/as/assumed/ > > Fixed. > > > Section 11: > > > > Although the cookie must allow the server to produce the right > > handshake transcript, they > > > > “It” not “they” (agreement in number) > > Fixed. > > > and > > > > DTLS with connection IDs allow for endpoint addresses to > > change during the association; > > > > “allows” not “allow” (agreement in number) > > Fixed. > > > > **** Eric Vyncke > > > > > > -- Section 3 -- > > s/TLS cannot be used directly in datagram environments/TLS cannot be used > > directly over a datagram transport/ ? > > > > Bullet 2) s/to enable reassembly in the correct order/to enable reordering/ > > ? > > Fixed. > > > > -- Section 3.1 -- > > Should there be a hint to a maximum retry count ? > > I'm not sure what we would put here given the diversity of environments, > so we opted not to. > > > > > -- Section 3.3 -- > > I understand the motivation (and no need to reply), but, sigh... > > implementing frag/reassembly above the transport layer... > > Indeed. If it helps, think of DTLS as the transport layer :) > > > **** Robert Wilton > > 1) Although it is clear from the metadata, it might be helpful if the > > introduction also stated that it obsoletes DTLS 1.2. > > Done. > > > 2) This document is a set of deltas against TLS 1.3. Given that it > > talks about the DTLS 1.1/1.2 documents being deltas in the > > introduction, I would have also included that information for this > > document in the introduction rather than in the Terminology and > > Considerations section. Initially, having read the introduction I had > > assumed that it was not going to be deltas. > > Done. > > > **** Bernard Aboba > > > Summary: The timeout and retransmission scheme looks workable for common > > cases, but could use some refinement to make it more robust. > > > > Technical Comments > > > > 4.5.2. Handling Invalid Records > > > > Unlike TLS, DTLS is resilient in the face of invalid records (e.g., > > invalid formatting, length, MAC, etc.). In general, invalid records > > SHOULD be silently discarded, thus preserving the association; > > however, an error MAY be logged for diagnostic purposes. > > > > [BA] How does silent discard of invalid records interact with > > retransmission timers? > > It doesn't. How could it? But I added some text anyway. > > > > Implementations which choose to generate an alert instead, MUST > > generate error alerts to avoid attacks where the attacker repeatedly > > probes the implementation to see how it responds to various types of > > error. Note that if DTLS is run over UDP, then any implementation > > which does this will be extremely susceptible to denial-of-service > > (DoS) attacks because UDP forgery is so easy. Thus, this practice is > > NOT RECOMMENDED for such transports, both to increase the reliability > > of DTLS service and to avoid the risk of spoofing attacks sending > > traffic to unrelated third parties. > > > > [BA] "this practice" refers to "generate an alert instead", correct? > > Yes. Addressed, > > > > 5.8.2. Timer Values > > > > Though timer values are the choice of the implementation, mishandling > > of the timer can lead to serious congestion problems, for example if > > > > [BA] Saying "timer values are the choice of the implementation" seems > > odd, because it is followed by normative language. I would delete this > > and start the sentence with "Mishandling...". > > It has been deleted. > > > > many instances of a DTLS time out early and retransmit too quickly on > > a congested link. Implementations SHOULD use an initial timer value > > of 100 msec (the minimum defined in RFC 6298 [RFC6298]) and double > > the value at each retransmission, up to no less than 60 seconds (the > > RFC 6298 maximum). Application specific profiles, such as those used > > for the Internet of Things environment, may recommend longer timer > > values. Note that a 100 msec timer is recommended rather than the > > 3-second RFC 6298 default in order to improve latency for time- > > sensitive applications. Because DTLS only uses retransmission for > > handshake and not dataflow, the effect on congestion should be > > minimal. > > > > Implementations SHOULD retain the current timer value until a message > > is transmitted and acknowledged without having to be retransmitted, > > at which time the value may be reset to the initial value. > > > > [BA] Is it always possible to distinguish a retransmission from a late > > arrival of an original packet? This seems like it could result in > > wrongly resetting the timer in some situations. > > The intent of this text is that you didn't retransmit at all. > > > > 5.8.3. Large Flight Sizes > > > > DTLS does not have any built-in congestion control or rate control; > > in general this is not an issue because messages tend to be small. > > However, in principle, some messages - especially Certificate - can > > be quite large. If all the messages in a large flight are sent at > > once, this can result in network congestion. A better strategy is to > > send out only part of the flight, sending more when messages are > > acknowledged. DTLS offers a number of mechanisms for minimizing the > > size of the certificate message, including the cached information > > extension [RFC7924] and certificate compression [RFC8879]. > > > > [BA] How does the implementation know how much of the flight to send? > > Not sure how prevalent large certs are for DTLS (e.g. compared with the > > self-signed certs of WebRTC), > > but in EAP-TLS deployments, large certs have caused problems. > > The EAP-TLS cert document draft-ietf-emu-eaptlscert cites some additional > > mechanisms for reducing certificate sizes, such as draft-ietf-tls-ctls > > and [RFC6066] which defines the "client_certificate_url" > > extension which allows TLS clients to send a sequence of Uniform > > Resource Locators (URLs) instead of the client certificate. > > We added some text. > > > > 5.11. Alert Messages > > > > Note that Alert messages are not retransmitted at all, even when they > > occur in the context of a handshake. However, a DTLS implementation > > which would ordinarily issue an alert SHOULD generate a new alert > > message if the offending record is received again (e.g., as a > > retransmitted handshake message). Implementations SHOULD detect when > > a peer is persistently sending bad messages and terminate the local > > connection state after such misbehavior is detected. Note that > > alerts are not reliably transmitted; implementation SHOULD NOT depend > > on receiving alerts in order to signal errors or connection closure. > > > > [BA] For the fatal alert case, it does seem like retransmission would > > be a good idea; otherwise the peer can be left hanging. > > This has been the practice since DTLS 1.0, and there's no way to > ack them, so I don't think we should change no. > > > > Section 7.1 > > "Disruptions" such as reordering do not affect timers, correct? > > No. The timers are only on the sender side, so they kind of > can't. > > > > > ACKs SHOULD NOT be sent for these flights unless generating the > > responding flight takes significant time. > > > > What is "significant time"? > > Rewritten. > > > > Editorial Comments (NITs) > > > > Section 2 > > > > The reader is also as to be familiar with > > > > [BA] "as" -> "assumed" > > Fixed. > > > Section 3 > > > > The basic design philosophy of DTLS is to construct "TLS over > > datagram transport". Datagram transport does not require nor provide > > reliable or in-order delivery of data. The DTLS protocol preserves > > this property for application data. Applications such as media > > streaming, Internet telephony, and online gaming use datagram > > transport for communication due to the delay-sensitive nature of > > transported data. The behavior of such applications is unchanged > > when the DTLS protocol is used to secure communication, since the > > DTLS protocol does not compensate for lost or reordered data traffic. > > > > [BA] While low-latency streaming and gaming does use DTLS to protect data > > (e.g. for > > protection of WebRTC data channel), telephony and RTC Audio/Video uses > > DTLS/SRTP for > > key derivation only, and SRTP for protection of data. So you might want to > > make a > > distinction. > > Done. > > > Section 3.1 > > > > Note that timeout and retransmission do not apply to the > > HelloRetryRequest since this would require creating state on the > > server. The HelloRetryRequest is designed to be small enough that it > > will not itself be fragmented, thus avoiding concerns about > > interleaving multiple HelloRetryRequests. > > > > [BA] I would add "For more detail on timeouts and retransmission, > > see Section 5.8." > > Done. > > > 4.3. Transport Layer Mapping > > > > DTLS messages MAY be fragmented into multiple DTLS records. Each > > DTLS record MUST fit within a single datagram. In order to avoid IP > > fragmentation, clients of the DTLS record layer SHOULD attempt to > > size records so that they fit within any PMTU estimates obtained from > > the record layer. > > > > [BA] You might reference PMTU considerations described in Section 4.4. > > Done. > > > Post-handshake client authentication > > > > Messages of each category can be sent independently, and reliability > > is established via independent state machines each of which behaves > > as described in Section 5.8.1. For example, if a server sends a > > NewSessionTicket and a CertificateRequest message, two independent > > state machines will be created. > > > > As explained in the corresponding sections, sending multiple > > instances of messages of a given category without having completed > > earlier transmissions is allowed for some categories, but not for > > others. Specifically, a server MAY send multiple NewSessionTicket > > messages at once without awaiting ACKs for earlier NewSessionTicket > > first. Likewise, a server MAY send multiple CertificateRequest > > messages at once without having completed earlier client > > authentication requests before. In contrast, implementations MUST > > NOT have send KeyUpdate, NewConnectionId or RequestConnectionId > > > > [BA] "send" -> "sent" > > Changed. > > > Example of Handshake with Timeout and Retransmission > > > > The following is an example of a handshake with lost packets and > > retransmissions. Note that the client sends an empty ACK message > > because it can only acknowledge Record 1 sent by the server once it > > has processed messages in Record 0 needed to establish epoch 2 keys, > > which are needed to encrypt to decrypt messages found in Record 1. > > > > [BA] "encrypt to decrypt" -> "encrypt or decrypt"? > > Changed. > > > Section 7.3 > > > > In the first case the use of the ACK message is optional because the > > peer will retransmit in any case and therefore the ACK just allows > > for selective retransmission, as opposed to the whole flight > > retransmission in previous versions of DTLS. For instance in the > > flow shown in Figure 11 if the client does not send the ACK message > > > > [BA] Figure 11 is the DTLS State Machine. Are you referring to another > > figure? > > Fixed. > > > The use of the ACK for the second case is mandatory for the proper > > functioning of the protocol. For instance, the ACK message sent by > > the client in Figure 13, acknowledges receipt and processing of > > record 4 (containing the NewSessionTicket message) and if it is not > > sent the server will continue retransmission of the NewSessionTicket > > indefinitely until its transmission cap is reached. > > > > [BA] Do you mean "maximum retransmission timemout value"? > > Fixed. > _______________________________________________ > TLS mailing list > TLS@ietf.org > https://www.ietf.org/mailman/listinfo/tls _______________________________________________ TLS mailing list TLS@ietf.org https://www.ietf.org/mailman/listinfo/tls