On Fri, May 26, 2023 at 06:25:46PM +0300, Vitaliy Makkoveev wrote: > On Fri, May 26, 2023 at 05:08:06PM +0200, Alexander Bluhm wrote: > > On Fri, May 26, 2023 at 05:29:58PM +0300, Vitaliy Makkoveev wrote: > > > On Wed, May 17, 2023 at 01:02:58PM +0300, Vitaliy Makkoveev wrote: > > > > mrt_rtwalk_mfcsysctl() performs read-only access to protected data, so > > > > rtable_walk() could be called with shared netlock. > > > > > > Regardless on sysctl(2) unlocking backout, the netlock around > > > mrt_sysctl_mfc() could be relaxed to shared netlock. > > > > IP multicast is far from MP ready. As a usual workaround I use > > exclusive netlock or shared netlock plus kernel lock. > > > > Whenever I have to call something with multicast from a section > > that has only shared netlock, I grab the kenrel lock. > > > > So using only NET_LOCK_SHARED() for reading multicast data does not > > seem enough. > > mrt_rtwalk_mfcsysctl() does read-only access. Since the sysctl(2) > unlocking was reverted, this will be "shared netlock plus kernel lock" > case. > > > Look at the way ip_input_if() calls ip_mforward(). Maybe we should > > start making ip_mroute.c MP safe. Unfortunately I have no test > > environment for that. > > mpi@ said the kernel lock removal from uvm_swap_free() will be easy. So > I want to try to remove it and push sysctl(2) unlocking back.
But you cannot do both. Move to shared lock in mrt_rtwalk_mfcsysctl, and remove kernel lock from sysctl. Write access is done with shared netlock plus kernel lock. ip_input_if() -> ip_mforward() -> mfc_add() -> update_mfc_params() -> mrt_mcast_del() -> rt->rt_llinfo = NULL; So shared netlock alone is not sufficient for read access. The popper way is to add some locking to mroute to protect itself when running with shared netlock. bluhm > > > > Index: sys/netinet/ip_input.c > > > > =================================================================== > > > > RCS file: /cvs/src/sys/netinet/ip_input.c,v > > > > retrieving revision 1.384 > > > > diff -u -p -r1.384 ip_input.c > > > > --- sys/netinet/ip_input.c 16 May 2023 19:36:00 -0000 1.384 > > > > +++ sys/netinet/ip_input.c 17 May 2023 09:59:16 -0000 > > > > @@ -1712,10 +1712,7 @@ ip_sysctl(int *name, u_int namelen, void > > > > case IPCTL_MRTMFC: > > > > if (newp) > > > > return (EPERM); > > > > - NET_LOCK(); > > > > - error = mrt_sysctl_mfc(oldp, oldlenp); > > > > - NET_UNLOCK(); > > > > - return (error); > > > > + return (mrt_sysctl_mfc(oldp, oldlenp)); > > > > case IPCTL_MRTVIF: > > > > if (newp) > > > > return (EPERM); > > > > Index: sys/netinet/ip_mroute.c > > > > =================================================================== > > > > RCS file: /cvs/src/sys/netinet/ip_mroute.c,v > > > > retrieving revision 1.138 > > > > diff -u -p -r1.138 ip_mroute.c > > > > --- sys/netinet/ip_mroute.c 19 Apr 2023 20:03:51 -0000 1.138 > > > > +++ sys/netinet/ip_mroute.c 17 May 2023 09:59:16 -0000 > > > > @@ -479,10 +479,12 @@ mrt_sysctl_mfc(void *oldp, size_t *oldle > > > > msa.msa_len = *oldlenp; > > > > msa.msa_needed = 0; > > > > > > > > + NET_LOCK_SHARED(); > > > > for (rtableid = 0; rtableid <= RT_TABLEID_MAX; rtableid++) { > > > > rtable_walk(rtableid, AF_INET, NULL, > > > > mrt_rtwalk_mfcsysctl, > > > > &msa); > > > > } > > > > + NET_UNLOCK_SHARED(); > > > > > > > > if (msa.msa_minfos != NULL && msa.msa_needed > 0 && > > > > (error = copyout(msa.msa_minfos, oldp, msa.msa_needed)) != > > > > 0) { > > > > > >