On 1/10/19 5:22 AM, Hyong Youb Kim wrote:
For bonding modes like broadcast that use bond_ethdev_rx_burst(), it
is fairly easy to produce a crash simply by bringing a slave port's
link down. When slave links go down, the driver on one thread reduces
active_slave_count via the LSC callback and deactivate_slave(). At the
same time, bond_ethdev_rx_burst() running on a forwarding thread may
increment active_slave (next active slave) beyond
active_slave_count. Here is a typical sequence of events.

At time 0:
active_slave_count = 3
active_slave = 2

At time 1:
A slave link goes down.
Thread 0 (main) reduces active_slave_count to 2.

At time 2:
Thread 1 (forwarding) executes bond_ethdev_rx_burst().
- Reads active_slave_count = 2.
- Increments active_slave at the end to 3.

 From this point on, everytime bond_ethdev_rx_burst() runs,
active_slave increments by one, eventually going well out of bound of
the active_slaves array and causing a crash.

Make the rx burst function to first check that active_slave is within
bound. If not, reset it to 0 to avoid out-of-range array access.

Fixes: e1110e977648 ("net/bonding: fix Rx slave fairness")
Cc: sta...@dpdk.org

Signed-off-by: Hyong Youb Kim <hyon...@cisco.com>

Acked-by: Chas Williams <ch...@att.com>

---
  drivers/net/bonding/rte_eth_bond_pmd.c | 14 ++++++++++++++
  1 file changed, 14 insertions(+)

diff --git a/drivers/net/bonding/rte_eth_bond_pmd.c 
b/drivers/net/bonding/rte_eth_bond_pmd.c
index daf2440cd..bc2405e54 100644
--- a/drivers/net/bonding/rte_eth_bond_pmd.c
+++ b/drivers/net/bonding/rte_eth_bond_pmd.c
@@ -68,6 +68,15 @@ bond_ethdev_rx_burst(void *queue, struct rte_mbuf **bufs, 
uint16_t nb_pkts)
        internals = bd_rx_q->dev_private;
        slave_count = internals->active_slave_count;
        active_slave = internals->active_slave;
+       /*
+        * Reset the active slave index, in case active_slave goes out
+        * of bound.  It can hapen when slave links go down, and
+        * another thread (LSC callback) shrinks the slave count.
+        */
+       if (active_slave >= slave_count) {
+               internals->active_slave = 0;
+               active_slave = 0;
+       }
for (i = 0; i < slave_count && nb_pkts; i++) {
                uint16_t num_rx_slave;
@@ -273,6 +282,11 @@ bond_ethdev_rx_burst_8023ad_fast_queue(void *queue, struct 
rte_mbuf **bufs,
        active_slave = internals->active_slave;
        memcpy(slaves, internals->active_slaves,
                        sizeof(internals->active_slaves[0]) * slave_count);
+       /* active_slave may go out of bound. See bond_ethdev_rx_burst() */
+       if (active_slave >= slave_count) {
+               internals->active_slave = 0;
+               active_slave = 0;
+       }
for (i = 0; i < slave_count && nb_pkts; i++) {
                uint16_t num_rx_slave;

Reply via email to