From: Ido Schimmel <ido...@mellanox.com> Currently, when non-default (custom) FIB rules are used, devices capable of layer 3 offloading flush their tables and let the kernel do the forwarding instead.
When these devices' drivers are loaded they register to the FIB notification chain, which lets them know about the existence of any custom FIB rules. This is done by sending a RULE_ADD notification based on the value of 'net->ipv4.fib_has_custom_rules'. This approach is problematic when VRF offload is taken into account, as upon the creation of the first VRF netdev, a l3mdev rule is programmed to direct skbs to the VRF's table. Instead of merely reading the above value and sending a single RULE_ADD notification, we should iterate over all the FIB rules and send a detailed notification for each, thereby allowing offloading drivers to sanitize the rules they don't support and potentially flush their tables. While l3mdev rules are uniquely marked, the default rules are not. Therefore, when they are being notified they might invoke offloading drivers to unnecessarily flush their tables. Solve this by uniquely marking the default rules. Add a 'def' member to the FIB rule struct and set it whenever a default rule is created. Signed-off-by: Ido Schimmel <ido...@mellanox.com> Signed-off-by: Jiri Pirko <j...@mellanox.com> --- include/net/fib_rules.h | 3 ++- net/core/fib_rules.c | 1 + 2 files changed, 3 insertions(+), 1 deletion(-) diff --git a/include/net/fib_rules.h b/include/net/fib_rules.h index 8dbfdf7..8103f99 100644 --- a/include/net/fib_rules.h +++ b/include/net/fib_rules.h @@ -23,7 +23,8 @@ struct fib_rule { u32 table; u8 action; u8 l3mdev; - /* 2 bytes hole, try to use */ + u8 def; + /* 1 byte hole, try to use */ u32 target; __be64 tun_id; struct fib_rule __rcu *ctarget; diff --git a/net/core/fib_rules.c b/net/core/fib_rules.c index b6791d9..fa92c71 100644 --- a/net/core/fib_rules.c +++ b/net/core/fib_rules.c @@ -39,6 +39,7 @@ int fib_default_rule_add(struct fib_rules_ops *ops, r->flags = flags; r->fr_net = ops->fro_net; r->uid_range = fib_kuid_range_unset; + r->def = 1; r->suppress_prefixlen = -1; r->suppress_ifgroup = -1; -- 2.7.4