Cong Wang <xiyou.wangc...@gmail.com> wrote:

>syzbot managed to trigger a recursive NETDEV_FEAT_CHANGE event
>between bonding master and slave. I managed to find a reproducer
>for this:
>
>  ip li set bond0 up
>  ifenslave bond0 eth0
>  brctl addbr br0
>  ethtool -K eth0 lro off
>  brctl addif br0 bond0
>  ip li set br0 up

        Presumably this is tied to the LRO feature being special in
netdev_sync_lower_features (via NETIF_F_UPPER_DISABLES), but why doesn't
LRO become disabled and stop the recursion once the test

                if (!(features & feature) && (lower->features & feature)) {

        no longer evalutes to true (in theory)?

        -J

>When a NETDEV_FEAT_CHANGE event is triggered on a bonding slave,
>it captures this and calls bond_compute_features() to fixup its
>master's and other slaves' features. However, when syncing with
>its lower devices by netdev_sync_lower_features() this event is
>triggered again on slaves, so it goes back and forth recursively
>until the kernel stack is exhausted.
>
>It is unnecessary to trigger it for a second time, because when
>we update the features from top down, we rely on each
>dev->netdev_ops->ndo_fix_features() to do the job, each stacked
>device should implement it. NETDEV_FEAT_CHANGE event is necessary
>when we update from bottom up, like in existing stacked device
>implementations.
>
>Just calling __netdev_update_features() is sufficient to fix this
>issue.
>
>Fixes: fd867d51f889 ("net/core: generic support for disabling netdev features 
>down stack")
>Reported-by: syzbot+e73ceacfd8560cc8a...@syzkaller.appspotmail.com
>Reported-by: syzbot+c2fb6f9ddcea95ba4...@syzkaller.appspotmail.com
>Cc: Jarod Wilson <ja...@redhat.com>
>Cc: Josh Poimboeuf <jpoim...@redhat.com>
>Cc: Jay Vosburgh <j.vosbu...@gmail.com>
>Cc: Jann Horn <ja...@google.com>
>Signed-off-by: Cong Wang <xiyou.wangc...@gmail.com>
>---
> net/core/dev.c | 2 +-
> 1 file changed, 1 insertion(+), 1 deletion(-)
>
>diff --git a/net/core/dev.c b/net/core/dev.c
>index 522288177bbd..ece50ae346c3 100644
>--- a/net/core/dev.c
>+++ b/net/core/dev.c
>@@ -8907,7 +8907,7 @@ static void netdev_sync_lower_features(struct net_device 
>*upper,
>                       netdev_dbg(upper, "Disabling feature %pNF on lower dev 
> %s.\n",
>                                  &feature, lower->name);
>                       lower->wanted_features &= ~feature;
>-                      netdev_update_features(lower);
>+                      __netdev_update_features(lower);
> 
>                       if (unlikely(lower->features & feature))
>                               netdev_WARN(upper, "failed to disable %pNF on 
> %s!\n",
>-- 
>2.26.2
>

---
        -Jay Vosburgh, jay.vosbu...@canonical.com

Reply via email to