21/03/2024 10:18, Ma, WenwuX: > From: Thomas Monjalon <tho...@monjalon.net> > > 21/03/2024 02:25, Ma, WenwuX: > > > From: Thomas Monjalon <tho...@monjalon.net> > > > > 20/03/2024 08:23, Wenwu Ma: > > > > > The structure rte_dma_dev needs to be aligned to the cache line, > > > > > but the return value of malloc may not be aligned to the cache > > > > > line. When we use memset to clear the rte_dma_dev object, it may > > > > > cause a segmentation fault in clang-x86-platform. > > > > > > > > > > This is because clang uses the "vmovaps" assembly instruction for > > > > > memset, which requires that the operands (rte_dma_dev objects) > > > > > must aligned on a 16-byte boundary or a general-protection > > > > > exception (#GP) is generated. > > > > > > > > > > Therefore, either additional memory is applied for re-alignment, > > > > > or the rte_dma_dev object does not require cache line alignment. > > > > > The patch chooses the former option to fix the issue. > > > > > > > > > > Fixes: b36970f2e13e ("dmadev: introduce DMA device library") > > > > > Cc: sta...@dpdk.org > > > > > > > > > > Signed-off-by: Wenwu Ma <wenwux...@intel.com> > > > > [..] > > > > > - size = dma_devices_max * sizeof(struct rte_dma_dev); > > > > > - rte_dma_devices = malloc(size); > > > > > - if (rte_dma_devices == NULL) > > > > > + /* The dma device object is expected to align cacheline, but > > > > > + * the return value of malloc may not be aligned to the cache > > > > > line. > > > > > + * Therefore, extra memory is applied for realignment. > > > > > + * note: We do not call posix_memalign/aligned_alloc because it > > > > > is > > > > > + * version dependent on libc. > > > > > + */ > > > > > + size = dma_devices_max * sizeof(struct rte_dma_dev) + > > > > > + RTE_CACHE_LINE_SIZE; > > > > > + ptr = malloc(size); > > > > > + if (ptr == NULL) > > > > > return -ENOMEM; > > > > > - memset(rte_dma_devices, 0, size); > > > > > + memset(ptr, 0, size); > > > > > + > > > > > + rte_dma_devices = RTE_PTR_ALIGN(ptr, RTE_CACHE_LINE_SIZE); > > > > > > > > Why not using aligned_alloc()? > > > > https://en.cppreference.com/w/c/memory/aligned_alloc > > > > > > > > > > > because it is version dependent on libc. > > > > Which libc is required? > > > > using the 'man aligned_alloc' command, we has the following description: > > VERSIONS > The functions memalign(), valloc(), and pvalloc() have been available > in all Linux libc libraries. > > The function aligned_alloc() was added to glibc in version 2.16.
released in 2012-06-30 > The function posix_memalign() is available since glibc 2.1.91. I think we could bump our libc requirements for these functions. I understand there is also a concern on Windows, but an alternative exists there. We may need a wrapper like "rte_alloc_align".