Hi Joyce, These APIs seem to be written considering the PMD requirements. Is there a need to expose these to applications (external to DPDK?).
> -----Original Message----- > From: Joyce Kong <joyce.k...@arm.com> > Sent: Wednesday, December 18, 2019 12:00 AM > To: tho...@monjalon.net; step...@networkplumber.org; > david.march...@redhat.com; m...@smartsharesystems.com; > jer...@marvell.com; bruce.richard...@intel.com; ravi1.ku...@amd.com; > rm...@marvell.com; shsha...@marvell.com; xuanziya...@huawei.com; > cloud.wangxiao...@huawei.com; zhouguoy...@huawei.com; Honnappa > Nagarahalli <honnappa.nagaraha...@arm.com>; Phil Yang > <phil.y...@arm.com>; Gavin Hu <gavin...@arm.com> > Cc: nd <n...@arm.com>; dev@dpdk.org > Subject: [PATCH v6 1/6] lib/eal: implement the family of rte bit operation > APIs > > There are a lot functions of bit operations scattered and duplicated in PMDs, > consolidating them into a common API family is necessary. Furthermore, > when the bit operation is applied to the IO devices, use __ATOMIC_ACQ_REL > to ensure the ordering for io bit operation. > > Signed-off-by: Joyce Kong <joyce.k...@arm.com> > Reviewed-by: Gavin Hu <gavin...@arm.com> > Reviewed-by: Phil Yang <phil.y...@arm.com> > Acked-by: Morten Brørup <m...@smartsharesystems.com> > --- > MAINTAINERS | 5 + > doc/api/doxy-api-index.md | 5 +- > lib/librte_eal/common/Makefile | 1 + > lib/librte_eal/common/include/rte_bitops.h | 474 > +++++++++++++++++++++++++++++ > lib/librte_eal/common/meson.build | 3 +- > 5 files changed, 485 insertions(+), 3 deletions(-) create mode 100644 > lib/librte_eal/common/include/rte_bitops.h > > diff --git a/MAINTAINERS b/MAINTAINERS > index 4395d8d..d2a29a2 100644 > --- a/MAINTAINERS > +++ b/MAINTAINERS > @@ -236,6 +236,11 @@ M: Cristian Dumitrescu > <cristian.dumitre...@intel.com> > F: lib/librte_eal/common/include/rte_bitmap.h > F: app/test/test_bitmap.c > > +Bitops > +M: Joyce Kong <joyce.k...@arm.com> > +F: lib/librte_eal/common/include/rte_bitops.h > +F: app/test/test_bitops.c > + > MCSlock - EXPERIMENTAL > M: Phil Yang <phil.y...@arm.com> > F: lib/librte_eal/common/include/generic/rte_mcslock.h > diff --git a/doc/api/doxy-api-index.md b/doc/api/doxy-api-index.md index > dff496b..ade7c01 100644 > --- a/doc/api/doxy-api-index.md > +++ b/doc/api/doxy-api-index.md > @@ -133,12 +133,13 @@ The public API headers are grouped by topics: > [BPF] (@ref rte_bpf.h) > > - **containers**: > + [bitmap] (@ref rte_bitmap.h), > + [bitops] (@ref rte_bitops.h), > [mbuf] (@ref rte_mbuf.h), > [mbuf pool ops] (@ref rte_mbuf_pool_ops.h), > [ring] (@ref rte_ring.h), > [stack] (@ref rte_stack.h), > - [tailq] (@ref rte_tailq.h), > - [bitmap] (@ref rte_bitmap.h) > + [tailq] (@ref rte_tailq.h) > > - **packet framework**: > * [port] (@ref rte_port.h): > diff --git a/lib/librte_eal/common/Makefile > b/lib/librte_eal/common/Makefile index c2c6d92..dd025c1 100644 > --- a/lib/librte_eal/common/Makefile > +++ b/lib/librte_eal/common/Makefile > @@ -19,6 +19,7 @@ INC += rte_malloc.h rte_keepalive.h rte_time.h INC += > rte_service.h rte_service_component.h INC += rte_bitmap.h rte_vfio.h > rte_hypervisor.h rte_test.h INC += rte_reciprocal.h rte_fbarray.h rte_uuid.h > +INC += rte_bitops.h > > GENERIC_INC := rte_atomic.h rte_byteorder.h rte_cycles.h rte_prefetch.h > GENERIC_INC += rte_memcpy.h rte_cpuflags.h diff --git > a/lib/librte_eal/common/include/rte_bitops.h > b/lib/librte_eal/common/include/rte_bitops.h > new file mode 100644 > index 0000000..34158d1 > --- /dev/null > +++ b/lib/librte_eal/common/include/rte_bitops.h > @@ -0,0 +1,474 @@ > +/* SPDX-License-Identifier: BSD-3-Clause > + * Copyright(c) 2019 Arm Limited > + */ > + > +#ifndef _RTE_BITOPS_H_ > +#define _RTE_BITOPS_H_ > + > +/** > + * @file > + * Bit Operations > + * > + * This file defines a API for bit operations without/with memory ordering. > + */ > + > +#include <stdint.h> > +#include <rte_debug.h> > +#include <rte_compat.h> > + > +/*---------------------------- 32 bit operations > +----------------------------*/ > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Get the target bit from a 32-bit value without memory ordering. > + * > + * @param nr > + * The target bit to get. > + * @param addr > + * The address holding the bit. > + * @return > + * The target bit. > + */ > +__rte_experimental > +static inline uint32_t > +rte_get_bit32_relaxed(unsigned int nr, uint32_t *addr) { Why not pass the memory order as a parameter? It would reduce the number of API calls by half. > + RTE_ASSERT(nr < 32); > + > + uint32_t mask = UINT32_C(1) << nr; > + return __atomic_load_n(addr, __ATOMIC_RELAXED) & mask; } > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Set the target bit in a 32-bit value to 1 without memory ordering. > + * > + * @param nr > + * The target bit to set. > + * @param addr > + * The address holding the bit. > + */ > +__rte_experimental > +static inline void > +rte_set_bit32_relaxed(unsigned int nr, uint32_t *addr) { > + RTE_ASSERT(nr < 32); > + > + uint32_t mask = UINT32_C(1) << nr; > + __atomic_fetch_or(addr, mask, __ATOMIC_RELAXED); } > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Clear the target bit in a 32-bit value to 0 without memory ordering. > + * > + * @param nr > + * The target bit to clear. > + * @param addr > + * The address holding the bit. > + */ > +__rte_experimental > +static inline void > +rte_clear_bit32_relaxed(unsigned int nr, uint32_t *addr) { > + RTE_ASSERT(nr < 32); > + > + uint32_t mask = UINT32_C(1) << nr; > + __atomic_fetch_and(addr, ~mask, __ATOMIC_RELAXED); } > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Return the original bit from a 32-bit value, then set it to 1 > +without > + * memory ordering. > + * > + * @param nr > + * The target bit to get and set. > + * @param addr > + * The address holding the bit. > + * @return > + * The original bit. > + */ > +__rte_experimental > +static inline uint32_t > +rte_test_and_set_bit32_relaxed(unsigned int nr, uint32_t *addr) { > + RTE_ASSERT(nr < 32); > + > + uint32_t mask = UINT32_C(1) << nr; > + return __atomic_fetch_or(addr, mask, __ATOMIC_RELAXED) & mask; } > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Return the original bit from a 32-bit value, then clear it to 0 > +without > + * memory ordering. > + * > + * @param nr > + * The target bit to get and clear. > + * @param addr > + * The address holding the bit. > + * @return > + * The original bit. > + */ > +__rte_experimental > +static inline uint32_t > +rte_test_and_clear_bit32_relaxed(unsigned int nr, uint32_t *addr) { > + RTE_ASSERT(nr < 32); > + > + uint32_t mask = UINT32_C(1) << nr; > + return __atomic_fetch_and(addr, ~mask, __ATOMIC_RELAXED) & > mask; } > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Get the target bit from a 32-bit value with memory ordering. > + * > + * @param nr > + * The target bit to get. > + * @param addr > + * The address holding the bit. > + * @return > + * The target bit. > + */ > +__rte_experimental > +static inline uint32_t > +rte_get_bit32(unsigned int nr, uint32_t *addr) { __atomic_load_n takes other memory orders along with relaxed and acquire. The API name needs to change to indicate acquire memory order here? > + RTE_ASSERT(nr < 32); > + > + uint32_t mask = UINT32_C(1) << nr; > + return __atomic_load_n(addr, __ATOMIC_ACQUIRE) & mask; } > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Set the target bit in a 32-bit value to 1 with memory ordering. > + * > + * @param nr > + * The target bit to set. > + * @param addr > + * The address holding the bit. > + */ > +__rte_experimental > +static inline void > +rte_set_bit32(unsigned int nr, uint32_t *addr) { > + RTE_ASSERT(nr < 32); > + > + uint32_t mask = UINT32_C(1) << nr; > + __atomic_fetch_or(addr, mask, __ATOMIC_ACQ_REL); } Why not use just '__ATOMIC_RELEASE' here? The full barrier might not be required in all use cases. > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Clear the target bit in a 32-bit value to 0 with memory ordering. > + * > + * @param nr > + * The target bit to clear. > + * @param addr > + * The address holding the bit. > + */ > +__rte_experimental > +static inline void > +rte_clear_bit32(unsigned int nr, uint32_t *addr) { > + RTE_ASSERT(nr < 32); > + > + uint32_t mask = UINT32_C(1) << nr; > + __atomic_fetch_and(addr, ~mask, __ATOMIC_ACQ_REL); } Why not use just '__ATOMIC_RELEASE' here? The full barrier might not be required in all use cases. I see similar issue in other APIs below. > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Return the original bit from a 32-bit value, then set it to 1 with > + * memory ordering. > + * > + * @param nr > + * The target bit to get and set. > + * @param addr > + * The address holding the bit. > + * @return > + * The original bit. > + */ > +__rte_experimental > +static inline uint32_t > +rte_test_and_set_bit32(unsigned int nr, uint32_t *addr) { > + RTE_ASSERT(nr < 32); > + > + uint32_t mask = UINT32_C(1) << nr; > + return __atomic_fetch_or(addr, mask, __ATOMIC_ACQ_REL) & > mask; } > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Return the original bit from a 32-bit value, then clear it to 0 with > + * memory ordering. > + * > + * @param nr > + * The target bit to get and clear. > + * @param addr > + * The address holding the bit. > + * @return > + * The original bit. > + */ > +__rte_experimental > +static inline uint32_t > +rte_test_and_clear_bit32(unsigned int nr, uint32_t *addr) { > + RTE_ASSERT(nr < 32); > + > + uint32_t mask = UINT32_C(1) << nr; > + return __atomic_fetch_and(addr, ~mask, __ATOMIC_ACQ_REL) & > mask; } > + > +/*---------------------------- 64 bit operations > +----------------------------*/ > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Get the target bit from a 64-bit value without memory ordering. > + * > + * @param nr > + * The target bit to get. > + * @param addr > + * The address holding the bit. > + * @return > + * The target bit. > + */ > +__rte_experimental > +static inline uint64_t > +rte_get_bit64_relaxed(unsigned int nr, uint64_t *addr) { > + RTE_ASSERT(nr < 64); > + > + uint64_t mask = UINT64_C(1) << nr; > + return __atomic_load_n(addr, __ATOMIC_RELAXED) & mask; } > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Set the target bit in a 64-bit value to 1 without memory ordering. > + * > + * @param nr > + * The target bit to set. > + * @param addr > + * The address holding the bit. > + */ > +__rte_experimental > +static inline void > +rte_set_bit64_relaxed(unsigned int nr, uint64_t *addr) { > + RTE_ASSERT(nr < 64); > + > + uint64_t mask = UINT64_C(1) << nr; > + __atomic_fetch_or(addr, mask, __ATOMIC_RELAXED); } > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Clear the target bit in a 64-bit value to 0 without memory ordering. > + * > + * @param nr > + * The target bit to clear. > + * @param addr > + * The address holding the bit. > + */ > +__rte_experimental > +static inline void > +rte_clear_bit64_relaxed(unsigned int nr, uint64_t *addr) { > + RTE_ASSERT(nr < 64); > + > + uint64_t mask = UINT64_C(1) << nr; > + __atomic_fetch_and(addr, ~mask, __ATOMIC_RELAXED); } > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Return the original bit from a 64-bit value, then set it to 1 > +without > + * memory ordering. > + * > + * @param nr > + * The target bit to get and set. > + * @param addr > + * The address holding the bit. > + * @return > + * The original bit. > + */ > +__rte_experimental > +static inline uint64_t > +rte_test_and_set_bit64_relaxed(unsigned int nr, uint64_t *addr) { > + RTE_ASSERT(nr < 64); > + > + uint64_t mask = UINT64_C(1) << nr; > + return __atomic_fetch_or(addr, mask, __ATOMIC_RELAXED) & mask; } > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Return the original bit from a 64-bit value, then clear it to 0 > +without > + * memory ordering. > + * > + * @param nr > + * The target bit to get and clear. > + * @param addr > + * The address holding the bit. > + * @return > + * The original bit. > + */ > +__rte_experimental > +static inline uint64_t > +rte_test_and_clear_bit64_relaxed(unsigned int nr, uint64_t *addr) { > + RTE_ASSERT(nr < 64); > + > + uint64_t mask = UINT64_C(1) << nr; > + return __atomic_fetch_and(addr, ~mask, __ATOMIC_RELAXED) & > mask; } > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Get the target bit from a 64-bit value with memory ordering. > + * > + * @param nr > + * The target bit to get. > + * @param addr > + * The address holding the bit. > + * @return > + * The target bit. > + */ > +__rte_experimental > +static inline uint64_t > +rte_get_bit64(unsigned int nr, uint64_t *addr) { > + RTE_ASSERT(nr < 64); > + > + uint64_t mask = UINT64_C(1) << nr; > + return __atomic_load_n(addr, __ATOMIC_ACQUIRE) & mask; } > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Set the target bit in a 64-bit value to 1 with memory ordering. > + * > + * @param nr > + * The target bit to set. > + * @param addr > + * The address holding the bit. > + */ > +__rte_experimental > +static inline void > +rte_set_bit64(unsigned int nr, uint64_t *addr) { > + RTE_ASSERT(nr < 64); > + > + uint64_t mask = UINT64_C(1) << nr; > + __atomic_fetch_or(addr, mask, __ATOMIC_ACQ_REL); } > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Clear the target bit in a 64-bit value to 0 with memory ordering. > + * > + * @param nr > + * The target bit to clear. > + * @param addr > + * The address holding the bit. > + */ > +__rte_experimental > +static inline void > +rte_clear_bit64(unsigned int nr, uint64_t *addr) { > + RTE_ASSERT(nr < 64); > + > + uint64_t mask = UINT64_C(1) << nr; > + __atomic_fetch_and(addr, ~mask, __ATOMIC_ACQ_REL); } > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Return the original bit from a 64-bit value, then set it to 1 with > + * memory ordering. > + * > + * @param nr > + * The target bit to get and set. > + * @param addr > + * The address holding the bit. > + * @return > + * The original bit. > + */ > +__rte_experimental > +static inline uint64_t > +rte_test_and_set_bit64(unsigned int nr, uint64_t *addr) { > + RTE_ASSERT(nr < 64); > + > + uint64_t mask = UINT64_C(1) << nr; > + return __atomic_fetch_or(addr, mask, __ATOMIC_ACQ_REL) & > mask; } > + > +/** > + * @warning > + * @b EXPERIMENTAL: this API may change, or be removed, without prior > +notice > + * > + * Return the original bit from a 64-bit value, then clear it to 0 with > + * memory ordering. > + * > + * @param nr > + * The target bit to get and clear. > + * @param addr > + * The address holding the bit. > + * @return > + * The original bit. > + */ > +__rte_experimental > +static inline uint64_t > +rte_test_and_clear_bit64(unsigned int nr, uint64_t *addr) { > + RTE_ASSERT(nr < 64); > + > + uint64_t mask = UINT64_C(1) << nr; > + return __atomic_fetch_and(addr, ~mask, __ATOMIC_ACQ_REL) & > mask; } > +#endif /* _RTE_BITOPS_H_ */ > diff --git a/lib/librte_eal/common/meson.build > b/lib/librte_eal/common/meson.build > index 2b97715..766edbd 100644 > --- a/lib/librte_eal/common/meson.build > +++ b/lib/librte_eal/common/meson.build > @@ -50,9 +50,10 @@ common_objs += eal_common_arch_objs > > common_headers = files( > 'include/rte_alarm.h', > + 'include/rte_bitmap.h', > + 'include/rte_bitops.h', > 'include/rte_branch_prediction.h', > 'include/rte_bus.h', > - 'include/rte_bitmap.h', > 'include/rte_class.h', > 'include/rte_common.h', > 'include/rte_compat.h', > -- > 2.7.4