On Thu, Sep 6, 2018 at 4:14 AM Vlad Buslov <vla...@mellanox.com> wrote: > > Isn't a concurrent tcf_idr_check_alloc() able to livelock here with > > your change? > > > > idr_for_each_entry_ul{ > > spin_lock(&idrinfo->lock); > > idr_remove(); > > spin_unlock(&idrinfo->lock); > > // tcf_idr_check_alloc() jumps in, > > // allocates next ID which can be found > > // by idr_get_next_ul() > > } // the whole loop goes _literately_ infinite... > > idr_for_each_entry_ul traverses idr entries with ascending order of > identifiers, so infinite livelock like this is not possible because it > never goes back to newly added entries with id<current_id.
I said "literately infinite", it could go from 1 to UINT_MAX, sufficient to prove my point of livelock. > > > > Also, idr_for_each_entry_ul() is supposed to be protected either > > by RCU or idrinfo->lock, no? With your change or without any change, > > it doesn't even have any lock after removing RTNL? > > After reading this comment I checked actual idr implementation and I > think you are right. Even though idr_for_each_entry_ul() macro (and > function idr_get_next_ul() that it uses to iterate over idr entries) > doesn't specify any locking requirements in comment description (that is > why this patch doesn't use any), its implementation seems to require > external synchronization. Yeah, it is also a reader, so either a reader lock like RCU or a writer lock like idrinfo->lock. > > You suggest I should just hold idrinfo->lock for whole del_walker loop > duration, or play nicely with potential concurrent users and > take/release it per action? My suggestion is pretty clear, you just missed it, let me copy-n-paste: With what I suggest: spin_lock(&idrinfo->lock); idr_for_each_entry_ul{ idr_remove(); } spin_unlock(&idrinfo->lock);