Tony-P,

I am not talking about LAGs but pure vanilla L3 ECMP paths in any DC.

Any node will be receiving at least two copies of flooded topology
(regardless in which direction you look up or down) so when one of the
links is broken which is used for actual flooding or peer sending link
state info goes down flooding graph can be repaired at its own pace while
data plane is not impacted as there are many more L3 paths still available.

That was my point.

Thx,
r.

On Tue, Mar 5, 2019 at 8:36 PM Tony Przygienda <[email protected]> wrote:

>
>
> On Tue, Mar 5, 2019 at 11:12 AM Robert Raszuk <[email protected]> wrote:
>
>>
>> > Slow convergence is obviously not a good thing
>>
>> Could you please kindly elaborate why ?
>>
>> With tons of ECMP in DCs or with number of mechanism for very fast data
>> plane repairs in WAN (well beyond FRR) IMHO any protocol *fast convergence*
>> is no longer a necessity. Yet many folks still talk about it like the only
>> possible rescue ...
>>
>>
>>
> Agree on FRR in WAN but it seems quite a complexity to drag that into DC
> fabrics (again, depeneding what topologies talk about) especially due to
> the effect I outline below.
>
> ECMP in DCs when used AFAIS is often caused by walking the 10G-50G curve
> where everybody has different economic sweet-spots AFAIS. Sometimes 100G.
> It's often LAG'ed (due to amount of control plane state AFAIU). LAG'ing
> generally creates interesting problems itself and I expect whole thing to
> settle somewhere with 20-50G to the server over either very short copper or
> multimode. I do not think it's a particularly cool architecture to suggest
> "LAG every link so it's 1:1 protected".
>
> Wide fanout is common & will get wider AFAIS. This is cool for failures in
> one direction but doesn't prevent you from blackholing on the "far end"
> until you get control plane reconvergence (or do FRR on a spine but
> honestly, how do you do that, bow-tieing over top of fabric or doing
> harmonica routing over multi-homed servers?)
>
> Then, slow convergence means that new stuff shows up slowly which is
> probably not very desirable and in case of failures that really need
> control plane rerouting causes longer duration blackholes as well. Both
> seem undesirable .. Not mentioning mobility even ...
>
> my
>
> [image: CodeCogsEqn(17).gif]   cents ;-)
>
> --- tony
>
>
>
_______________________________________________
Lsr mailing list
[email protected]
https://www.ietf.org/mailman/listinfo/lsr

Reply via email to