Albert – Do not confuse the current lack of detection with when the problem gets introduced.
The fact that the problem is not detected on protocol adjacency formation does not mean the problem gets introduced afterwards. Unless you are saying that folks change the link MTU AFTER the link comes up and has been used for a while the problem exists as soon as the link comes up. You could therefore detect this in a number of ways: 1)Use IS-IS ☺ 2)Enhance other routing protocols to do what IS-IS does 3)Send BFD large packets (Echo or async) 4)Potentially some other OAM mechanism In regards to #3, unless you believe link MTUs will change “on the fly”, sending the large packets during initial session bringup would be sufficient. If folks are concerned (as I am) that sending large BFD packets all the time introduces some risks for scalability/stability of BFD sessions, then one strategy would be to send the large packets only on session bringup. FYI, some implementations have knobs on IS-IS to do exactly this i.e., send padded hellos until the adjacency is formed – then revert to small hellos. In the case of BFD I think there is a more compelling reason to be conservative in how often you send large packets given where it is implemented and how often the BFD packets are sent. Les From: Albert Fu (BLOOMBERG/ 120 PARK) <af...@bloomberg.net> Sent: Monday, October 29, 2018 10:39 AM To: Les Ginsberg (ginsberg) <ginsb...@cisco.com>; jh...@juniper.net; rtg-bfd@ietf.org Subject: Re: BFD WG adoption for draft-haas-bfd-large-packets Hi Les, > Jeff/Albert - > > Given the MTU issue is associated with a link coming up - and the use of Echo > would allow the problem to be detected and prevent the BFD session from > coming up - > and you are acknowledging that the protocol allows padded Echo packets today > ... > > is there really a need to do anything more? > > Les > Actually, all the issues we have observed were not associated with link going up. The MTU issues occurred after OSPF/BGP had established adjacency without any events on the routers. ospf/BGP hellos/keepalives continued to be transmitted fine (small packet size), but applications sending max size packets over the link would time out and fail. Hence, I mentioned several times that this issue is rather time consuming to troubleshoot, as the cause is with the Telco network and outside of our control and we do not see any alarms. I did also look at BFD echo mode. As Jeff indicated, this is not widely deployed (among the vendors we use, only one supports it). Thanks Albert