[Gen-art] Gen-ART Telechat review of draft-ietf-anima-grasp-api-08

Paul Kyzivat Mon, 30 Nov 2020 18:06:56 -0800

I am the assigned Gen-ART reviewer for this draft. The General AreaReview Team (Gen-ART) reviews all IETF documents being processed by theIESG for the IETF Chair. Please wait for direction from your documentshepherd or AD before posting a new version of the draft. For moreinformation, please see the FAQ at <http://wiki.tools.ietf.org/area/gen/trac/wiki/GenArtfaq>.


Document: draft-ietf-anima-grasp-api-08
Reviewer: Paul Kyzivat
Review Date: 2020-11-30
IETF LC End Date: 2020-10-28
IESG Telechat date: 2020-12-01


Summary:

This draft is on the right track but has open issues, described in thereview.


General:

This document has addressed some of the concerns I had during the lastcall review. However some of my concerns remain and some new ones havearisen in this version.


Issues:

Major: 3
Minor: 6
Nits:  1

1) MAJOR: Negotiation

The text in section 2.3.5 now makes clear that the sequence of steps inthe negotiation is non-deterministic - both sides can callnegotiate_step and negotiate_wait. I believe this can result in the twosides not agreeing on what values have been negotiated. (For instance,what if one side calls negotiate_step concurrently with the other sidecalling end_negotiate? Which value has been agreed upon?) The loop_countadds to the confusion. Are the two sides intended to have independentloop count values? It seems these too can become unsynchronized.

Also, the goal of negotiation isn't clear to me. I gather it must be forthe two sides to agree on a particular value for the objective. But forthat to work there must be some rules about how values can change ineach step so that the result stabililizes, rather than causing a battlethat ends with loop count exhaustion. This could be achieved by alwaysnegotiating *down*, or always *up*. But that requires that the objectivevalue type have an ordering function. Given the general nature of theobjective I don't think that can be assumed.

ISTM that more work is needed to define the negotiation process in a waythat ensures it ends with both sides agreeing on a single value for theobjective.


2) MINOR: Dry Run Negotiation

Dry Run negotiation is very under-specified. Why would it be used? Iguess that an ASA might use dry run negotiation to inform future actualnegotiation. Can anything be inferred from a dry run negotiation abouthow an actual negotiation will go? When participating in a dry runnegotiation, how should an ASA decide what response to make? Should ittake into account current resource availability? Or should it respondbased on best-case or worst-case resource availability? Or what?


This requires further clarification.

3) MAJOR: Confusing semantics of 'request_negotiate'

In section 2.3.5 I don't understand the following:

         1.  The 'session_nonce' parameter is null.  In this case the
             negotiation has succeeded in one step and the peer has
             accepted the request.  The returned 'proffered_objective'
             contains the value accepted by the peer, which is therefore
             equal to the value in the requested 'objective'.  For this
             reason, no session nonce is needed, since the session has
             ended.

IIUC this requires a network exchange with the peer. I don't see howthis can complete *immediately*. ISTM that this could only completeimmediately if it were satisfied from a local cache. That doesn't seemappropriate for this function.

Similarly, in bullet 2 I don't see how the proffered_objective would beavailable in the initial call, before a response has been received fromthe peer..

Does "immediately" here simply mean that the negotiation is completed inone exchange between the two ends? If so, isn't a session nonce stillrequired in an event loop implementation in order to handle the oneresponse?


Bullet 2 also says:

             ... The
             returned 'proffered_objective' contains the first value
             proffered by the negotiation peer.  The contents of this
             instance of the objective must be used to prepare the next
             negotiation step (see negotiate_step() below) because it
             contains the updated loop count, sent by the negotiation
             peer.  The GRASP code automatically decrements the loop
             count by 1 at each step, and returns an error if it becomes
             zero.

I guess that the 'proffered_objective' in the return parameters is thecounter-offer to the objective passed in the call. And that you expectthe objective value used in any subsequent negotiate_step to be derivedby modifying this value. So far this new wording has improved myunderstanding.

But the loop_count in the objective is especially confusing. It seemsthat it is handled quite differently from the rest of the objective. Youspecify (in 2.3.2.3) that it has a default value of GRASP_DEP_LOOPCT.But who is expected to initialize this? (Is it simply that the ASPshould use this value if it doesn't have any particular preference?)

Then you say that the GRASP decrements this. Is this decrementing doneon the calling side before sending the message, the calling side afterreceiving the response? Or by the peer, on receipt or when sending theresponse? Is it permissible for the ASA to modify this value duringnegotiation? Since this seems intended to prevent a loop, having clarityabout how this value is managed seems important.


4) MINOR: negotiate_wait

The negotiate_wait call allows one ASA to extend the timeout of anotherASA. This could, in perverse cases, cause an ASA to wait indefinitely.ISTM that this is dangerous. I would think it better make the other ASAaware of the desire to extend the timeout and let it decide whether todo so.


5) MAJOR: Consistency of Objective definitions

In section 2.3.2.3 and elsewhere, presumably all parties that use aparticular objective must agree on the values of synch, neg, dry, andthe size and structure of the value.

There is no communication of the size and structure in the abstract API.Presumably the implementation of a language binding to the API isrequired to at least communicate the size and alignment requirements tothe core. The matching of definitions between nodes must be achievedsolely by the name, the respective language bindings at the two ends,and out of band mutual agreement. Furthermore, different languagebindings may use different in-memory representations of the value. Insuch cases, how is the on-wire format to be determined?

If the two ends disagree on size and structure then problems will occur.Perhaps the core can identify size mismatches based on size communicatedon the wire vs the size defined by the language binding, but there areno error codes defined for this situation. And of course differingstructures with the same size would not be detectable.

Furthermore, there is potential for different ASAs to (accidentally)have incompatible definitions for the same objective. What happens inthis case? How can blame be ascribed so that the problem can be fixed?

IMO more needs to be said about all of this. At the least a number ofdisclaimers that put the burden on the ASAs to recognize the risk, takethese potential problems into account and avoid them. But there could besome requirements placed on API language bindings and coreimplementations to deal with some of these. And probably some addederror codes to report what problems can be detected.


6) MINOR/MAJOR: Session State

I continue to find the lifetime and state of a session to be unclear.The API calls that return session_nonce seem to signal creation of a newsession. The end_negotiate() call seems to terminate a negotiationsession. But what causes other sessions to end? This seems importantbecause there is state associated with a session that consumes resourcesand can't be reclaimed until the session ends. So it should be importantfor the ASA to end all sessions. Some clarification of this seemsimportant both for core implementors and for ASA developers that will beusing the API.

(Or is this document only for implementors of core and thoseinstantiating a particular language binding of the API, withdocumentation for end users left to others?)


7) MINOR/MAJOR: Timeout

Section 2.3.2.2 indicates that the API returns an error response to theASA if the timeout expires. But the other end is presumably stillworking on the request and will eventually send a response. What doesthe core do when it receives this? Must it retain state so that it candetect the case and ignore the message? It seems that this could resultin the two peers disagreeing on some state.


8) MINOR: Text regarding "minimum_TTL"

There is a small problem with the following in section 2.3.4:

      -  If the parameter 'minimum_TTL' is greater than zero, any
         locally cached locators for the objective whose remaining time
         to live in milliseconds is less than or equal to 'minimum_TTL'
         are deleted first.  Thus 'minimum_TTL' = 0 will flush all
         entries.

The first sentence qualifies the paragraph to cases where minimum_TTL isgreater than zero. But the final sentence then infers the behavior whenminimum_TTL is equal to zero.

Also, minimum_TTL is typed as an integer, which permits negative values.I gather that negative values are not allowed. I can suggest two ways tofix this:


      -  The parameter 'minimum_TTL' MUST be greater than or equal to
         zero. Any locally cached locators for the objective whose
         remaining time to live in milliseconds is less than or equal to
         'minimum_TTL' are deleted first.  Thus 'minimum_TTL' = 0 will
         flush all entries.

Or, change they type to unsigned integer. Then the statement can besimplified by removing the first sentence:


      -  Any locally cached locators for the objective whose remaining
         time to live in milliseconds is less than or equal to
         'minimum_TTL' are deleted first.  Thus 'minimum_TTL' = 0 will
         flush all entries.

9) MINOR: Terminology - Session nonce

The new first paragraph of section 2.2.3 talks about identifying thesession by a pseudo-random session identifier, and tagging it with an IPfor further uniqueness. The 2nd paragraph talks about a session_nonce.It isn't clear at this point in the text if these the same thing. Or isthe session id shared on the wire, the IP tag added by the core, and thesession_nonce an artifact of the API, shared only between the ASA andthe core?

Section 2.3.2.7 seems to confirm that the nonce is just an identifierused between the core and the ASA. But here it says that using the idplus the IP is simply one possible implementation choice.

Further, I question whether "nonce" is the best term to use here. ISTMthat "handle" (session_handle) would more clearly reflect the purpose ofthis item.

I think it would be helpful to be clearer in distinguishing what isfundamental vs what is implementation choice. For instance, in section2.2.3:


   A GRASP session consists of a finite sequence of messages (for
   discovery, synchronization, or negotiation) between a pair of ASAs.
   The core identifies it on the wire by a pseudo-random session
   identifier. Further details are given in [I-D.ietf-anima-grasp].

   On the first call in a new GRASP session, the API returns a
   'session_handle' value used to identify the session. This
   value must be used in all subsequent calls for the same session, and
   will be provided as a parameter in the callback functions.  By this
   mechanism, multiple overlapping sessions can be distinguished, both
   in the ASA and in the GRASP core.  The value of the 'session_handle"
   is opaque to the ASA.

This establishes the role and relationship of the two terms, whilesection 2.3.2.7 gives a possible implementation without as muchconfusion. (It will require some rewording to switch from session_nonceto session_handle. It already uses "session handle" in passing.)


10) NIT: Terminology - ASA nonce

For similar reasons to those above for session_nonce/session_handle, IMOit would be clearer to use asa_handle rather than asa_nonce. But this isonly a suggestion.


_______________________________________________
Gen-art mailing list
Gen-art@ietf.org
https://www.ietf.org/mailman/listinfo/gen-art

[Gen-art] Gen-ART Telechat review of draft-ietf-anima-grasp-api-08

Reply via email to