> -----Original Message-----
> From: Greg KH [mailto:gre...@linuxfoundation.org]
> Sent: Tuesday, July 26, 2016 9:41 PM
> To: KY Srinivasan <k...@microsoft.com>
> Cc: linux-ker...@vger.kernel.org; de...@linuxdriverproject.org; linux-
> r...@vger.kernel.org; yish...@mellanox.com; sean.he...@intel.com;
> dledf...@redhat.com; o...@aepfle.de; a...@canonical.com;
> vkuzn...@redhat.com; jasow...@redhat.com;
> leann.ogasaw...@canonical.com; Long Li <lon...@microsoft.com>
> Subject: Re: [PATCH 1/1] Drivers: infiniband: hw: vmbus-nd: NetworkDirect
> driver for Linux
> 
> On Tue, Jul 26, 2016 at 07:05:37PM -0700, k...@exchange.microsoft.com
> wrote:
> > +/*
> > + * Create a char device that can support read/write for passing
> > + * the payload.
> > + */
> 
> That sounds "interesting"...
> 
> > +
> > +static struct completion ip_event;
> > +static bool opened;
> > +
> > +char hvnd_ip_addr[4];
> > +char hvnd_mac_addr[6];
> > +bool hvnd_addr_set;
> 
> Global variables?

> 
> > +
> > +int hvnd_get_ip_addr(char **ip_addr, char **mac_addr)
> > +{
> > +   int t;
> > +
> > +   /*
> > +    * Now wait for the user level daemon to get us the
> > +    * IP addresses bound to the MAC address.
> > +    */
> > +   if (!hvnd_addr_set) {
> > +           t = wait_for_completion_timeout(&ip_event, 600*HZ);
> > +           if (t == 0)
> > +                   return -ETIMEDOUT;
> > +   }
> > +
> > +   if (hvnd_addr_set) {
> > +           *ip_addr = hvnd_ip_addr;
> > +           *mac_addr = hvnd_mac_addr;
> > +           return 0;
> > +   }
> > +
> > +   return -ENODATA;
> > +}
> > +
> > +static ssize_t hvnd_write(struct file *file, const char __user *buf,
> > +                   size_t count, loff_t *ppos)
> > +{
> > +   char input[120];
> > +   int scaned, i;
> > +   unsigned int mac_addr[6], ip_addr[4];
> > +
> > +   if (hvnd_addr_set) {
> > +           hvnd_error("IP/MAC address already set, ignoring input\n");
> > +           return count;
> > +   }
> > +
> > +   if (count > sizeof(input)-1)
> > +           return -EINVAL;
> > +
> > +   if (copy_from_user(input, buf, count))
> > +           return -EFAULT;
> > +
> > +   input[count] = 0;
> > +
> > +   /*
> > +    * Wakeup the context that may be waiting for this.
> > +    */
> > +   hvnd_debug("get user mode input: %s\n", input);
> > +
> > +   scaned = sscanf(input,
> > +           "rdmaMacAddress=\"%x:%x:%x:%x:%x:%x\"
> rdmaIPv4Address=\"%u.%u.%u.%u\"",
> > +           &mac_addr[0],
> > +           &mac_addr[1],
> > +           &mac_addr[2],
> > +           &mac_addr[3],
> > +           &mac_addr[4],
> > +           &mac_addr[5],
> > +           &ip_addr[0],
> > +           &ip_addr[1],
> > +           &ip_addr[2],
> > +           &ip_addr[3]);
> 
> Oh, that's a mess, you are going to parse text in the kernel that is
> passed on a char device?  Please tell me that not all IB drivers are
> like this...

Greg,

This driver is plugging into the Windows NetworkDirect infrastructure on the 
host side.
The fabric assigns the MAC/IP address for the interface. I have chosen this 
mechanism for
passing the information to the kernel driver. I can certainly look at other 
mechanism.

> 
> > +
> > +   if (scaned == 10) {
> > +
> > +           for (i = 0; i < 6; i++)
> > +                   hvnd_mac_addr[i] = (char) mac_addr[i];
> > +           for (i = 0; i < 4; i++)
> > +                   hvnd_ip_addr[i] = (char) ip_addr[i];
> > +
> > +           hvnd_error("Scanned IP address: %pI4 Mac address: %pM\n",
> > +                      hvnd_ip_addr, hvnd_mac_addr);
> > +
> > +           hvnd_addr_set = true;
> > +           complete(&ip_event);
> > +   }
> > +
> > +   return count;
> > +}
> > +
> > +static int hvnd_open(struct inode *inode, struct file *f)
> > +{
> > +   /*
> > +    * The user level daemon that will open this device is
> > +    * really an extension of this driver. We can have only
> > +    * active open at a time.
> 
> Do you have a pointer to that code?  As it's a logical extension, you
> know what the license for that code better be... :)

This is part of the automation to spin up RDMA capable VMs on Azure.
Linux VMs on Azure include an agent that I used to provision the VMs
(Distro vendors currently ship this agent). Here is the agent code:

https://github.com/Azure/WALinuxAgent/tree/archive/2.0

Currently all the provisioning work is done in the agent code and this includes
provisioning the RDMA NIC - passing the MAC/IP address assigned by the host.
 
> 
> > +    */
> > +   if (opened)
> > +           return -EBUSY;
> 
> You just raced, and lost, oops :(

This is just to catch bugs in the agent code; the only open will be from the
agent.
> 
> There are better ways to do this, the easiest being, why do you need
> "exclusive" access at all?

This case should not happen since we have written the agent code and only that 
code
should inject the provisioning information. 
> 
> > +
> > +   /*
> > +    * The daemon is alive; setup the state.
> > +    */
> > +   opened = true;
> > +   return 0;
> > +}
> > +
> > +static int hvnd_release(struct inode *inode, struct file *f)
> > +{
> > +   /*
> > +    * The daemon has exited; reset the state.
> > +    */
> > +   opened = false;
> > +   return 0;
> > +}
> > +
> > +
> > +static const struct file_operations hvnd_fops = {
> > +   .write          = hvnd_write,
> > +   .release        = hvnd_release,
> > +   .open           = hvnd_open,
> > +};
> > +
> > +static struct miscdevice hvnd_misc = {
> > +   .minor          = MISC_DYNAMIC_MINOR,
> > +   .name           = "hvnd_rdma",
> > +   .fops           = &hvnd_fops,
> > +};
> > +
> > +static int hvnd_dev_init(void)
> > +{
> > +   init_completion(&ip_event);
> > +   return misc_register(&hvnd_misc);
> > +}
> > +
> > +static void hvnd_dev_deinit(void)
> > +{
> > +
> > +   /*
> > +    * The device is going away - perhaps because the
> > +    * host has rescinded the channel. Setup state so that
> > +    * user level daemon can gracefully exit if it is blocked
> > +    * on the read semaphore.
> > +    */
> > +   opened = false;
> 
> But if it's blocked, it's not going to get unblocked here :(

Sorry about the stale comment. We have a couple of Hyper-V daemons that use
a char device to support bi-directional communication between the kernel and 
user land
(the KVP daemon is a good example). When I started this work, the requirements 
here were
very similar - I needed a mechanism to inject some configuration information 
from user-land
into the kernel. So I began with the code I had used elsewhere and made the 
necessary
adjustments. I will cleanup the code and comments.
> 
> 
> > +   /*
> > +    * Signal the semaphore as the device is
> > +    * going away.
> > +    */
> > +   misc_deregister(&hvnd_misc);
> > +}
> 
> Your comment doesn't match the code you are calling.


This will be cleaned up.
> 
> I gave up here, sorry.
> 
> Exactly why do you want a char interface?  It looks like you are using
> it to configure your "hardware", surely there is already other ways to
> do this and not every driver needs to roll-their-own like this?

Well, I have to live within the Windows ecosystem. The Fabric controller 
provides the
provisioning information and that needs to be injected into the kernel. The 
choice I made was
a simple pattern. I can certainly look at implementing a driver specific IOCTL 
that allows the
agent code to write the provisioning information.

Thanks for the feedback; I will fix up the issues you have raised.

Regards,

K. Y


> 
> thanks,
> 
> greg k-h
_______________________________________________
devel mailing list
de...@linuxdriverproject.org
http://driverdev.linuxdriverproject.org/mailman/listinfo/driverdev-devel

Reply via email to