On Thu, Aug 31, 2023 at 03:34:37PM +1000, Michael Ellerman wrote: > Michal Suchánek <msucha...@suse.de> writes: > > Hello, > > > > thanks for working on this. > > > > On Tue, Aug 22, 2023 at 04:33:39PM -0500, Nathan Lynch via B4 Relay wrote: > >> From: Nathan Lynch <nath...@linux.ibm.com> > >> > >> PowerVM LPARs may retrieve Vital Product Data (VPD) for system > >> components using the ibm,get-vpd RTAS function. > >> > >> We can expose this to user space with a /dev/papr-vpd character > >> device, where the programming model is: > >> > >> struct papr_location_code plc = { .str = "", }; /* obtain all VPD */ > >> int devfd = open("/dev/papr-vpd", O_WRONLY); > >> int vpdfd = ioctl(devfd, PAPR_VPD_CREATE_HANDLE, &plc); > >> size_t size = lseek(vpdfd, 0, SEEK_END); > >> char *buf = malloc(size); > >> pread(devfd, buf, size, 0); > >> > >> When a file descriptor is obtained from ioctl(PAPR_VPD_CREATE_HANDLE), > >> the file contains the result of a complete ibm,get-vpd sequence. The > > > > Could this be somewhat less obfuscated? > > > > What the caller wants is the result of "ibm,get-vpd", which is a > > well-known string identifier of the rtas call. > > Not really. What the caller wants is *the VPD*. Currently that's done > by calling the RTAS "ibm,get-vpd" function, but that could change in > future. There's RTAS calls that have been replaced with a "version 2" in > the past, that could happen here too. Or the RTAS call could be replaced > by a hypercall (though unlikely). > > But hopefully if the underlying mechanism changed the kernel would be > able to hide that detail behind this new API, and users would not need > to change at all.
Still the kernel could use the name that is well-known today even if it uses different implementation internally in the future. > > > Yet this identifier is never passed in. Instead we have this new > > PAPR_VPD_CREATE_HANDLE. This is a completely new identifier, specific to > > this call only as is the /dev/papr-vpd device name, another new > > identifier. > > > > Maybe the interface could provide a way to specify the service name? > > > >> file contents are immutable from the POV of user space. To get a new > >> view of VPD, clients must create a new handle. > > > > Which is basically the same as creating a file descriptor with open(). > > Sort of. But much cleaner becuase you don't need to create a file in the > filesystem and tell userspace how to find it. > > This pattern of creating file descriptors from existing file descriptors > to model a hiearachy of objects is well established in eg. the KVM and > DRM APIs. > > >> The memory required for the VPD buffers seems acceptable, around 20KB > >> for all VPD on one of my systems. And the value of the > >> /rtas/ibm,vpd-size DT property (the estimated maximum size of VPD) is > >> consistently 300KB across various systems I've checked. > >> > >> I've implemented support for this new ABI in the rtas_get_vpd() > >> function in librtas, which the vpdupdate command currently uses to > >> populate its VPD database. I've verified that an unmodified vpdupdate > >> binary generates an identical database when using a librtas.so that > >> prefers the new ABI. > >> > >> Likely remaining work: > >> > >> * Handle RTAS call status -4 (VPD changed) during ibm,get-vpd call > >> sequence. > >> * Prevent ibm,get-vpd calls via rtas(2) from disrupting ibm,get-vpd > >> call sequences in this driver. > >> * (Maybe) implement a poll method for delivering notifications of > >> potential changes to VPD, e.g. after a partition migration. > > > > That sounds like something for netlink. If that is desired maybe it > > should be used in the first place? > > I don't see why that is related to netlink. It's entirely normal for > file descriptor based APIs to implement poll. > > netlink adds a lot of complexity for zero gain IMO. It kind of solves the problem with shoehorning something that's not really a file into file descriptors. You don't have to when not using them. It also solves how to access multiple services without creating a large number of files and large number of obscure constants. Thanks Michal