Re: [dpdk-dev] [PATCH v3] hash table: add an iterator over conflicting entries

Michel Machado Fri, 07 Sep 2018 04:28:44 -0700

On 09/05/2018 04:27 PM, Wang, Yipeng1 wrote:

Hmm I see, it falls back to my original thought to have malloc inside the init 
function..
Thanks for the explanation. :)


So I guess with your implementation, in future if we change the internal state 
to be larger,
the ABI will be broken.

If that happens, yes, the ABI would need to change again. But thisconcern is overblown for two reasons. First, this event is unlikely tohappen because struct rte_hash_iterator_state is already allocating 64bytes while struct rte_hash_iterator_istate and structrte_hash_iterator_conflict_entries_istate consume 16 and 20 bytes,respectively. Thus, the complexity of the underlying hash algorithmwould need to grow substantially to force the necessary state of theseiterators to grow more than 4x and 3x, respectively. This is unlikely tohappen, and, if it does, it would likely break the ABI somewhere elseand have a high impact on applications anyway.

Second, even if the unlikely event happens, all one would need to dois to increase the size of struct rte_hash_iterator_state, mark the newAPI as a new version, and applications would be ready for the new ABIjust recompiling.

BTW, this patch set also changes API so proper notice is needed.
People more familiar with API/ABI change policies may be able to help here.


   We'd be happy to get feedback on this aspect.

Just to confirm, is there anyway like I said for your application to have some 
long-live states
and reuse them throughout the application so that you don’t have to have 
short-lived ones in stack?

Two things would need to happen for this to be possible. The initfunctions would need to accept previously allocated iterator states,that is, the init function would act as a reset of the state when actingon a previous allocated state. And, applications would now need to carrythese pre-allocated state to avoid a malloc. In order words, we'llincrease the complexity of the API.

To emphasize that the cost of a malloc is not negligible,rte_malloc() needs to get a spinlock (see heap_alloc_on_socket()), doits thing to allocate memory, and, if the first attempt fails, try toallocate the memory on other sockets (see end of malloc_heap_alloc()).For an iterator that goes through the whole hash table, this cost may beokay, but for an iterator that goes through a couple entries, this costis a lot to add.

This memory allocation concern is not new. Functionrte_pktmbuf_read(), for example, let applications pass buffers, whichare often allocated in the execution stack, to avoid the malloc cost.


[ ]'s
Michel Machado

Re: [dpdk-dev] [PATCH v3] hash table: add an iterator over conflicting entries

Reply via email to