On Tue, May 05, 2020 at 02:58:19PM -0700, Cong Wang wrote:
> syzbot managed to trigger a recursive NETDEV_FEAT_CHANGE event
> between bonding master and slave. I managed to find a reproducer
> for this:
> 
>   ip li set bond0 up
>   ifenslave bond0 eth0
>   brctl addbr br0
>   ethtool -K eth0 lro off
>   brctl addif br0 bond0
>   ip li set br0 up
> 
> When a NETDEV_FEAT_CHANGE event is triggered on a bonding slave,
> it captures this and calls bond_compute_features() to fixup its
> master's and other slaves' features. However, when syncing with
> its lower devices by netdev_sync_lower_features() this event is
> triggered again on slaves, so it goes back and forth recursively
> until the kernel stack is exhausted.
> 
> It is unnecessary to trigger it for a second time, because when
> we update the features from top down, we rely on each
> dev->netdev_ops->ndo_fix_features() to do the job, each stacked
> device should implement it. NETDEV_FEAT_CHANGE event is necessary
> when we update from bottom up, like in existing stacked device
> implementations.
> 
> Just calling __netdev_update_features() is sufficient to fix this
> issue.
> 
> Fixes: fd867d51f889 ("net/core: generic support for disabling netdev features 
> down stack")
> Reported-by: syzbot+e73ceacfd8560cc8a...@syzkaller.appspotmail.com
> Reported-by: syzbot+c2fb6f9ddcea95ba4...@syzkaller.appspotmail.com
> Cc: Jarod Wilson <ja...@redhat.com>
> Cc: Josh Poimboeuf <jpoim...@redhat.com>
> Cc: Jay Vosburgh <j.vosbu...@gmail.com>
> Cc: Jann Horn <ja...@google.com>
> Signed-off-by: Cong Wang <xiyou.wangc...@gmail.com>
> ---
>  net/core/dev.c | 2 +-
>  1 file changed, 1 insertion(+), 1 deletion(-)
> 
> diff --git a/net/core/dev.c b/net/core/dev.c
> index 522288177bbd..ece50ae346c3 100644
> --- a/net/core/dev.c
> +++ b/net/core/dev.c
> @@ -8907,7 +8907,7 @@ static void netdev_sync_lower_features(struct 
> net_device *upper,
>                       netdev_dbg(upper, "Disabling feature %pNF on lower dev 
> %s.\n",
>                                  &feature, lower->name);
>                       lower->wanted_features &= ~feature;
> -                     netdev_update_features(lower);
> +                     __netdev_update_features(lower);
>  
>                       if (unlikely(lower->features & feature))
>                               netdev_WARN(upper, "failed to disable %pNF on 
> %s!\n",

Wouldn't this mean that when we disable LRO on a bond manually with
"ethtool -K", LRO will be also disabled on its slaves but no netlink
notification for them would be sent to userspace?

Michal

Reply via email to