HI,

We are using OVS with the DPDK datapath and leveraging the OVS hardware offload 
feature with Marvell CN10K accelerators to achieve high performance.

However, we are encountering a system crash whenever a non-PMD thread (carrying 
an invalid rte_lcore_id) attempts to transmit a packet, in scenarios like
ARP resolution and managing bridge traffic, non-PMD threads (e.g., the main 
thread) flush some packets out using dp_netdev_pmd_flush_output_packets().

CN10K PMD TX routine requires each worker thread to write packets in its own 
dedicated region and hit the corresponding doorbell to ensure synchronization.
To avoid conditional checks in the TX fast path, base address of these regions 
is calculated based on lcore_id.
When a thread with an invalid lcore_id attempts to transmit, it calculates an 
incorrect base address and triggers the doorbell, resulting in a crash when the 
hardware
accesses this invalid address.

To mitigate this crash we came up with a workaround where a non-pmd thread 
registers for a valid lcore_id (using rte_thread_register()) before 
transmission 
and deregister back (rte_thread_unregister()) on completion, as shown in the 
following patch:

        diff --git a/lib/dpif-netdev.c b/lib/dpif-netdev.c
        index 99ff9b369..6edc5eb29 100644
        --- a/lib/dpif-netdev.c
        +++ b/lib/dpif-netdev.c
        @@ -5599,6 +5599,7 @@ dp_netdev_pmd_flush_output_packets(struct 
dp_netdev_pmd_thread *pmd,
         {
                 struct tx_port *p;
                 int output_cnt = 0;
        +    bool dpdk_attached;

                 if (!pmd->n_output_batches) {
                         return 0;
        @@ -5607,7 +5608,11 @@ dp_netdev_pmd_flush_output_packets(struct 
dp_netdev_pmd_thread *pmd,
                 HMAP_FOR_EACH (p, node, &pmd->send_port_cache) {
                         if (!dp_packet_batch_is_empty(&p->output_pkts)
                                 && (force || pmd->ctx.now >= p->flush_time)) {
        +            if (pmd->core_id == NON_PMD_CORE_ID)
        +                  dpdk_attached = dpdk_attach_thread(0);
                                 output_cnt += 
dp_netdev_pmd_flush_output_on_port(pmd, p);
        +            if (dpdk_attached && pmd->core_id == NON_PMD_CORE_ID)
        +                  dpdk_detach_thread();
                         }
                 }
                 return output_cnt;
             

Is the above change acceptable in OVS? Could you please suggest a solution for 
our problem? We strictly want to avoid any condition checks in our TX fast path 
routine.

Thanks
Harman
_______________________________________________
discuss mailing list
disc...@openvswitch.org
https://mail.openvswitch.org/mailman/listinfo/ovs-discuss

Reply via email to