Hi Lucy,

It is certainly true that the BGP route distribution mechanism is not optimal for multicast signaling. The advantages and disadvantages of using BGP for multicast signaling were discussed extensively in the WG when RFCs 6513 and 6514 were being written. But the entire mechanism is built around the standard BGP route distribution procedures, and one has to be very cautious about making changes.

> [Lucy] What you say is that, in BGP distribution mechanism, BGP
> (child here) chooses the bestpath (i.e. parent here) for a particular
> NLRI, then distributes the NLRI to all BGP peers (including the
> parent node).

Right.  But ...

One thing you have to remember is that the child might not have a BGP session directly with the parent. For instance, one can have a deployment where ingress replication is used in a non-segmented manner, such that the ingress PE makes a copy for each egress PE. In this case, when an egress PE originates a Leaf A-D route, the RT will identify the ingress PE. But ingress and egress PEs generally don't have BGP sessions to each other. In many deployments, the ingress and egress PEs are clients of a route reflector; the egress PE sends the Leaf A-D route to the RR, and the RR redistributes it to the ingress PE. In other deployments, the Leaf A-D route from a given egress PE may have to travel over several BGP sessions before it reaches the ingress PE.

> Since the NLRI is only stored by the parent node and may be removed
> by old parent node, such distribution mechanism has no advance for
> such purpose and causes a scaling issue. To reduce the distribution
> symptom, it should explicitly require that, if a node receiving a
> leaf A-D route is not the parent node including old parent node, the
> node should not redistribute the leaf A-D route; in other words, only
> the parent node is allowed to readvertise the leaf A-D route.

This would break the deployment scenarios described above.

> One question, if one ASBR or ABR node that is the parent for a set of
> downstream neighbors fails, what is the procedure for the downstream
> neighbors to select a new parent?

If a node fails, its BGP sessions go down, and the routes it originated are considered to have been withdrawn. If parent and child are separated by a RR, failure of the parent will cause its session to the RR to go down. The RR will then send withdraws to the child for all of the routes that were originated by the parent and redistributed to the child.

If there are two potential parents for a given flow, each will have originated an S-PMSI A-D route with an NLRI identifying the flow. One of these S-PMSI A-D routes is the bestpath for that NLRI, and the next hop (or P2MP Inter-Area Segmented Next Hop Extended Community) of that route determines the parent that is chosen by a given child. If that parent fails, the route from the other potential parent becomes the best path, and the child will then rehome itself by changing the RT on the Leaf A-D route.

> If a child fails, the parent should
> update multicast state as if the child is withdrawn.

If a child fails, its Leaf A-D route appears to the parent to have been withdrawn, and this causes the change in multicast state.

Note that the procedures for dealing with withdrawn S-PMSI or Leaf A-D routes are not specific to Ingress Replication.

Eric

_______________________________________________
BESS mailing list
BESS@ietf.org
https://www.ietf.org/mailman/listinfo/bess

Reply via email to