[Lsr] Re: Consensus Call on LSR WG work on "Leaderless Flooding Algorithm for Distributed Flood Reduction to allow reduced configuration, minimal blast radius, and ease of incremental deployment"

Gunter van de Velde (Nokia) Tue, 19 Nov 2024 11:53:40 -0800

[speaking as WG participant]

Inline: GV>

From: Les Ginsberg (ginsberg) <ginsberg=40cisco....@dmarc.ietf.org>
Sent: Tuesday, November 19, 2024 7:59 PM
To: Gunter van de Velde (Nokia) <gunter.van_de_ve...@nokia.com>; Acee Lindem 
<acee.i...@gmail.com>; lsr <lsr@ietf.org>
Subject: RE: [Lsr] Re: Consensus Call on LSR WG work on "Leaderless Flooding 
Algorithm for Distributed Flood Reduction to allow reduced configuration, 
minimal blast radius, and ease of incremental deployment"

CAUTION: This is an external email. Please be very careful when clicking links 
or opening attachments. See the URL nok.it/ext for additional information.

Gunter –

I would probably disagree with just about every bullet point you have below. 😊

GV> I expected nothing else 😊
GV> The explicit question raised by Acee was “The question to be answered by 
this consensus call is whether or not we want to work on an additional 
mechanism just for distributed flooding reduction to allow for reduced 
configuration, minimal blast radius, and ease of incremental deployment. ”
GV> My answer (as participant) is “yes, we should”.

But that discussion is premature.

IMO what we need to agree on now is:

1)We do want to have a discussion on leader/leaderless
2)Such discussion should take place independent of any specific algorithm
3)If we agree on the above, then the distoptflood draft needs to be updated to 
be agnostic to the enablement mode.

GV> This was not asked in this consensus call. At this stage it sounds 
premature to discuss. I personally have no strong preference for separate 
documents or a single document. There are pro’s and con’s for each approach.

Once that is agreed upon we need to decide on a context (AKA draft) in which 
this discussion could take place.
A suggestion has been made that this could be done as an update to RFC 9667 – I 
like that idea. But there are other ways this could be done as well.

But I would like us to agree on the three points above first before diving inot 
the technical discussion.

GV> My feeling (as participant) is that a reasonable first step would be to 
understand if the WG find consensus to work on a leaderless architecture or 
not. If the WG decide it is not feasible, then the discussion ends and we 
should stop spending time on the subject.

G/

   Les

From: Gunter van de Velde (Nokia) 
<gunter.van_de_velde=40nokia....@dmarc.ietf.org<mailto:gunter.van_de_velde=40nokia....@dmarc.ietf.org>>
Sent: Tuesday, November 19, 2024 10:35 AM
To: Acee Lindem <acee.i...@gmail.com<mailto:acee.i...@gmail.com>>; lsr 
<lsr@ietf.org<mailto:lsr@ietf.org>>
Subject: [Lsr] Re: Consensus Call on LSR WG work on "Leaderless Flooding 
Algorithm for Distributed Flood Reduction to allow reduced configuration, 
minimal blast radius, and ease of incremental deployment"

[speaking as WG participant]

Hi Acee, All,

The current draft needs some work to be better readable. It is a non-trivial 
read. I am convinced that when there is consensus to develop the leaderless 
solution that this can be worked out accordingly. In my opinion, the greatest 
advantage of a leaderless technology architecture is its operational 
simplicity, particularly because it eliminates the need for a coordinated 
transition or "flag day."

A few benefits I see from a leaderless approach:

  *   Enhanced Redundancy: Multiple nodes participate in flooding, reducing the 
risk associated with any single node failure.
  *   Simpler Protocol Design: Eliminates the need for leader election and 
maintenance mechanisms, simplifying the protocol implementation.
  *   Better Scalability: Distributes the processing load across multiple 
nodes, preventing bottlenecks.
  *   Faster Convergence: Avoids delays associated with leader re-election, 
enabling quicker dissemination of routing updates.

Some concerns I see with a flooding leader approach:

  *   Single Point of Failure (the flooding leader becomes a critical component 
in the network. If the leader fails or becomes unreachable, it can disrupt the 
flooding process until a new leader is elected or the network converges to an 
alternative state)
  *   Increased Complexity (Implementing a flooding leader requires additional 
mechanisms for leader election, maintenance, and failure detection)
  *   Scalability Concerns (in large-scale or highly dynamic networks, managing 
a single flooding leader can become a bottleneck)
  *   Convergence Delays (when the flooding leader fails, the network must 
initiate a leader re-election process)
  *   Lack of Redundancy (relying on a single leader reduces redundancy in the 
flooding process)
  *   Overhead of Leader Maintenance (continuous monitoring is required to 
ensure the flooding leader is operational)
  *   Potential for Suboptimal Flooding Paths (the flooding leader may not 
always have the most efficient paths to all nodes, especially in dynamic 
topologies)
  *   Complex Recovery Mechanisms (recovering from leader failures may involve 
complex procedures that differ from standard link-state protocol operations)

I believe there is place for both a flooding leader and leaderless 
architecture. It depends upon type of network where this is implemented (for 
example Datacenter or Service Provider WAN).

Be well,
G/

From: Acee Lindem <acee.i...@gmail.com<mailto:acee.i...@gmail.com>>
Sent: Monday, November 18, 2024 3:40 PM
To: lsr <lsr@ietf.org<mailto:lsr@ietf.org>>
Subject: [Lsr] Consensus Call on LSR WG work on "Leaderless Flooding Algorithm 
for Distributed Flood Reduction to allow reduced configuration, minimal blast 
radius, and ease of incremental deployment"

CAUTION: This is an external email. Please be very careful when clicking links 
or opening attachments. See the URL nok.it/ext for additional information.

During the IETF 121 LSR meeting, we had a very good discussion of 
https://datatracker.ietf.org/doc/draft-ietf-lsr-distoptflood/

While the CDS flooding reduction algorithm itself was interesting, the main 
area of the discussion was distributed flooding reduction algorithm selection 
and its configuration and deployment.

Today, we have RFC 9667 “Dynamic Flooding on Dense Graphs” which provides 
leader-based supporting both centralized and distributed flooding reduction 
algorithm selection. With the mechanism, an area leader is selected from among 
the OSPF routers in the area supporting flooding reduction and this leader 
selects the flooding reduction algorithm for the area.

The question to be answered by this consensus call is whether or not we want to 
work on an additional mechanism just for distributed flooding reduction to 
allow for reduced configuration, minimal blast radius, and ease of incremental 
deployment.

Please send your support or opposition to this list before Tuesday, December 
3rd.

As WG Co-Chair,
Acee

_______________________________________________
Lsr mailing list -- lsr@ietf.org
To unsubscribe send an email to lsr-le...@ietf.org

[Lsr] Re: Consensus Call on LSR WG work on "Leaderless Flooding Algorithm for Distributed Flood Reduction to allow reduced configuration, minimal blast radius, and ease of incremental deployment"

Reply via email to