On Sun, Oct 13, 2019 at 3:22 PM Guillaume Nault <gna...@redhat.com> wrote: > > On Sun, Oct 13, 2019 at 12:09:43PM -0700, Pravin Shelar wrote: > > On Thu, Oct 10, 2019 at 12:07 PM Guillaume Nault <gna...@redhat.com> wrote: > > > > > > In rtnl_net_notifyid(), we certainly can't pass a null GFP flag to > > > rtnl_notify(). A GFP_KERNEL flag would be fine in most circumstances, > > > but there are a few paths calling rtnl_net_notifyid() from atomic > > > context or from RCU critical section. The later also precludes the use > > > of gfp_any() as it wouldn't detect the RCU case. Also, the nlmsg_new() > > > call is wrong too, as it uses GFP_KERNEL unconditionally. > > > > > > Therefore, we need to pass the GFP flags as parameter. The problem then > > > propagates recursively to the callers until the proper flags can be > > > determined. The problematic call chains are: > > > > > > * ovs_vport_cmd_fill_info -> peernet2id_alloc -> rtnl_net_notifyid > > > > > > * rtnl_fill_ifinfo -> rtnl_fill_link_netnsid -> peernet2id_alloc > > > -> rtnl_net_notifyid > > > > > > For openvswitch, ovs_vport_cmd_get() and ovs_vport_cmd_dump() prevent > > > ovs_vport_cmd_fill_info() from using GFP_KERNEL. It'd be nice to move > > > the call out of the RCU critical sections, but struct vport doesn't > > > have a reference counter, so that'd probably require taking the ovs > > > lock. Also, I don't get why ovs_vport_cmd_build_info() used GFP_ATOMIC > > > in nlmsg_new(). I've changed it to GFP_KERNEL for consistency, as this > > > functions seems to be allowed to sleep (as stated in the comment, it's > > > called from a workqueue, under the protection of a mutex). > > > > > It is safe to change GFP flags to GFP_KERNEL in ovs_vport_cmd_build_info(). > > The patch looks good to me. > > > Thanks for your feedback. > > The point of my RFC is to know if it's possible to avoid all these > gfp_t flags, by allowing ovs_vport_cmd_fill_info() to sleep (at least > I'd like to figure out if it's worth spending time investigating this > path). > > To do so, we'd requires moving the ovs_vport_cmd_fill_info() call of > ovs_vport_cmd_{get,dump}() out of RCU critical section. Since we have > no reference counter, I believe we'd have to protect these calls with > ovs_lock() instead of RCU. Is that acceptable? If not, is there any > other way?
I do not see point of added complexity and serialized OVS flow dumps just to avoid GFP_ATOMIC allocations in some code path. What is issue passing the parameter as you have done in this patch?