On Mon, May 02, 2016 at 05:29:37PM +0530, Jerin Jacob wrote:
> prefetch the next packet data address in advance in macswap loop
> for performance improvement.
> 
> Signed-off-by: Jerin Jacob <jerin.jacob at caviumnetworks.com>
> ---
>  app/test-pmd/macswap.c | 3 +++
>  1 file changed, 3 insertions(+)
> 
> diff --git a/app/test-pmd/macswap.c b/app/test-pmd/macswap.c
> index 154889d..c10f4b5 100644
> --- a/app/test-pmd/macswap.c
> +++ b/app/test-pmd/macswap.c
> @@ -113,6 +113,9 @@ pkt_burst_mac_swap(struct fwd_stream *fs)
>       if (txp->tx_ol_flags & TESTPMD_TX_OFFLOAD_INSERT_QINQ)
>               ol_flags |= PKT_TX_QINQ_PKT;
>       for (i = 0; i < nb_rx; i++) {
> +             if (likely(i < nb_rx - 1))
> +                     rte_prefetch0(rte_pktmbuf_mtod(pkts_burst[i + 1],
> +                                                    void *));

At least on IA platforms, there is no issue with prefetching beyond the end of
the array, since it's only a hint to the cpu. If this is true for other 
platforms,
then I suggest we just drop the conditional and just always prefetch.

/Bruce

>               mb = pkts_burst[i];
>               eth_hdr = rte_pktmbuf_mtod(mb, struct ether_hdr *);
>  
> -- 
> 2.1.0
> 

Reply via email to