> -----Original Message-----
> From: dev [mailto:dev-boun...@dpdk.org] On Behalf Of Zhang, Qi Z
> Sent: Wednesday, July 11, 2018 9:26 AM
> To: Burakov, Anatoly <anatoly.bura...@intel.com>; tho...@monjalon.net
> Cc: Ananyev, Konstantin <konstantin.anan...@intel.com>; dev@dpdk.org;
> Richardson, Bruce <bruce.richard...@intel.com>; Yigit, Ferruh
> <ferruh.yi...@intel.com>; Shelton, Benjamin H
> <benjamin.h.shel...@intel.com>; Vangati, Narender
> <narender.vang...@intel.com>
> Subject: Re: [dpdk-dev] [PATCH v10 05/19] eal: enable hotplug on
> multi-process
> 
> 
> 
> > -----Original Message-----
> > From: Burakov, Anatoly
> > Sent: Tuesday, July 10, 2018 10:01 PM
> > To: Zhang, Qi Z <qi.z.zh...@intel.com>; tho...@monjalon.net
> > Cc: Ananyev, Konstantin <konstantin.anan...@intel.com>; dev@dpdk.org;
> > Richardson, Bruce <bruce.richard...@intel.com>; Yigit, Ferruh
> > <ferruh.yi...@intel.com>; Shelton, Benjamin H
> > <benjamin.h.shel...@intel.com>; Vangati, Narender
> > <narender.vang...@intel.com>
> > Subject: Re: [PATCH v10 05/19] eal: enable hotplug on multi-process
> >
> > On 09-Jul-18 4:36 AM, Qi Zhang wrote:
> > > We are going to introduce the solution to handle hotplug in
> > > multi-process, it includes the below scenario:
> > >
> > > 1. Attach a device from the primary
> > > 2. Detach a device from the primary
> > > 3. Attach a device from a secondary
> > > 4. Detach a device from a secondary
> > >
> > > In the primary-secondary process model, we assume devices are shared
> > > by default. that means attaches or detaches a device on any process
> > > will broadcast to all other processes through mp channel then device
> > > information will be synchronized on all processes.
> > >
> > > Any failure during attaching/detaching process will cause
> > > inconsistent status between processes, so proper rollback action should be
> considered.
> > >
> > > This patch covers the implementation of case 1,2.
> > > Case 3,4 will be implemented on a separate patch.
> > >
> > > IPC scenario for Case 1, 2:
> > >
> > > attach a device
> > > a) primary attach the new device if failed goto h).
> > > b) primary send attach sync request to all secondary.
> > > c) secondary receive request and attach the device and send a reply.
> > > d) primary check the reply if all success goes to i).
> > > e) primary send attach rollback sync request to all secondary.
> > > f) secondary receive the request and detach the device and send a reply.
> > > g) primary receive the reply and detach device as rollback action.
> > > h) attach fail
> > > i) attach success
> > >
> > > detach a device
> > > a) primary send detach sync request to all secondary
> > > b) secondary detach the device and send reply
> > > c) primary check the reply if all success goes to f).
> > > d) primary send detach rollback sync request to all secondary.
> > > e) secondary receive the request and attach back device. goto g)
> > > f) primary detach the device if success goto g), else goto d)
> > > g) detach fail.
> > > h) detach success.
> > >
> > > Signed-off-by: Qi Zhang <qi.z.zh...@intel.com>
> > > ---
> >
> > <snip>
> >
> > > + req.t = EAL_DEV_REQ_TYPE_ATTACH;
> > > + strlcpy(req.busname, busname, RTE_BUS_NAME_MAX_LEN);
> > > + strlcpy(req.devname, devname, RTE_DEV_NAME_MAX_LEN);
> > > + strlcpy(req.devargs, devargs, RTE_DEV_ARGS_MAX_LEN);
> > > +
> > > + if (rte_eal_process_type() != RTE_PROC_PRIMARY)
> > > +         return -ENOTSUP;
> >
> > Nitpick, but maybe do this before strlcpy?
> 
> On the next patch, these strlcpy can be reused when implemented secondary
> process case
> 
> >
> > > +
> > > + /**
> > > +  * attach a device from primary start from here:
> > > +  *
> > > +  * a) primary attach the new device if failed goto h).
> > > +  * b) primary send attach sync request to all secondary.
> > > +  * c) secondary receive request and attach the device and send a reply.
> > > +  * d) primary check the reply if all success goes to i).
> >
> > <snip>
> >
> > > +
> > > + memset(&req, 0, sizeof(req));
> > > + req.t = EAL_DEV_REQ_TYPE_DETACH;
> > > + strlcpy(req.busname, busname, RTE_BUS_NAME_MAX_LEN);
> > > + strlcpy(req.devname, devname, RTE_DEV_NAME_MAX_LEN);
> > > +
> > > + if (rte_eal_process_type() != RTE_PROC_PRIMARY)
> > > +         return -ENOTSUP;
> >
> > Same nitpick, probably move this above.
> >
> > > +
> > > + /**
> > > +  * detach a device from primary start from here:
> > > +  *
> > > +  * a) primary send detach sync request to all secondary
> > > +  * b) secondary detach the device and send reply
> >
> > <snip>
> >
> > > + struct mp_reply_bundle *bundle = param;
> > > + struct rte_mp_msg *msg = &bundle->msg;
> > > + const struct eal_dev_mp_req *req =
> > > +         (const struct eal_dev_mp_req *)msg->param;
> > > + struct rte_mp_msg mp_resp;
> > > + struct eal_dev_mp_req *resp =
> > > +         (struct eal_dev_mp_req *)mp_resp.param;
> > > + int ret = 0;
> > > +
> > > + memset(&mp_resp, 0, sizeof(mp_resp));
> > > +
> > > + switch (req->t) {
> > > + case EAL_DEV_REQ_TYPE_ATTACH:
> > > + case EAL_DEV_REQ_TYPE_DETACH_ROLLBACK:
> > > +         ret = do_dev_hotplug_add(req->busname, req->devname, "");
> >
> > I'm not too familiar with devargs and hotplug, but why are we passing
> > empty devargs string here? Is it possible for it to be not empty?
> 
> For secondary process, devargs is ignored, so we just need the device unique
> identity <busname, devname>
> 
> >
> > > +         break;
> > > + case EAL_DEV_REQ_TYPE_DETACH:
> > > + case EAL_DEV_REQ_TYPE_ATTACH_ROLLBACK:
> > > +         ret = do_dev_hotplug_remove(req->busname, req->devname);
> > > +         break;
> > > + default:
> > > +         ret = -EINVAL;
> > > + }
> >
> > <snip>
> >
> > > + if (rte_eal_process_type() == RTE_PROC_PRIMARY) {
> > > +         ret = rte_mp_action_register(EAL_DEV_MP_ACTION_REQUEST,
> > > +                                 handle_secondary_request);
> > > +         if (ret) {
> > > +                 RTE_LOG(ERR, EAL, "Couldn't register '%s' action\n",
> > > +                         EAL_DEV_MP_ACTION_REQUEST);
> > > +                 return ret;
> > > +         }
> > > + } else {
> > > +         ret = rte_mp_action_register(EAL_DEV_MP_ACTION_REQUEST,
> > > +         handle_primary_request);
> >
> > ^^ wrong indentation.
> 
> Will fix.
> >
> > > +         if (ret) {
> > > +                 RTE_LOG(ERR, EAL, "Couldn't register '%s' action\n",
> > > +                         EAL_DEV_MP_ACTION_REQUEST);
> > > +                 return ret;
> > > +         }
> > > + }
> >
> > <snip>
> >
> > > +
> > > +#endif /* _HOTPLUG_MP_H_ */
> > > diff --git a/lib/librte_eal/common/include/rte_bus.h
> > > b/lib/librte_eal/common/include/rte_bus.h
> > > index eb9eded4e..720f7c3c8 100644
> > > --- a/lib/librte_eal/common/include/rte_bus.h
> > > +++ b/lib/librte_eal/common/include/rte_bus.h
> > > @@ -197,6 +197,9 @@ struct rte_bus_conf {
> > >   typedef enum rte_iova_mode (*rte_bus_get_iommu_class_t)(void);
> > >
> > >
> > > +/* Max length for a bus name */
> > > +#define RTE_BUS_NAME_MAX_LEN 32
> >
> > Is this enforced anywhere in the bus codebase? Can we guarantee that
> > bus name will never be bigger than this?
> 
> I think 32 should be enough for a bus name even in future.

Sorry, I missed your point, I think it is not enforced, we still can add a new 
bus exceed 32, 
but for RTE_DEV_NAME_MAX_LEN which is used in rte_devargs to enforce all device 
name not exceed 64.
So, it's better to move RTE_BUS_NAME_MAX_LEN into hotplug_mp as internal , and 
this can be regarded as a limitation for hotplug so far, though it should be 
enough for all exist cases.
And same for RTE_DEV_ARGS_MAX_LEN.

> 
> >
> > > +
> > >   /**
> > >    * A structure describing a generic bus.
> > >    */
> > > diff --git a/lib/librte_eal/common/include/rte_dev.h
> > > b/lib/librte_eal/common/include/rte_dev.h
> > > index 3879ff3ca..667df20f0 100644
> > > --- a/lib/librte_eal/common/include/rte_dev.h
> > > +++ b/lib/librte_eal/common/include/rte_dev.h
> > > @@ -152,6 +152,9 @@ struct rte_driver {
> > >    */
> > >   #define RTE_DEV_NAME_MAX_LEN 64
> > >
> > > +/* Max devargs length be allowed */ #define RTE_DEV_ARGS_MAX_LEN
> > > +128
> >
> > Same - is this enforced anywhere in the codebase related to devargs?
> 
> I'm not sure, but I guess it is big enough for all exist driver :)
> 
> Thanks
> Qi
> 
> >
> > --
> > Thanks,
> > Anatoly

Reply via email to