[Qemu-devel] RFC: Let NBD client request read-only mode

Eric Blake Wed, 29 Nov 2017 07:00:49 -0800

Right now, only the server can choose whether an export is read-only. Aclient can always treat an export as read-only by not sending anywrites, but a server has no guarantee that a client will behave thatway, and must assume that an export where the server did not advertiseNBD_FLAG_READ_ONLY will modify the export. Therefore, if the serverdoes not want to permit simultaneous modifications to the underlyingdata, it has the choice of either permitting only one client at a time,or supporting multiple connections but enforcing all subsequentconnections to see the NBD_FLAG_READ_ONLY bit on the export that isalready in use by the first connection (note that this is racy - whoeverconnects first is the only one that can get write permissions, even ifthe first connected client doesn't want to write).

However, at least qemu has a case where it would be nice to permit aparallel known-read-only client from the same server that is (or willbe) handling a read-write client; and what's more, to make it so thatthe read-only client can win the race of being the first connectionwithout penalizing the actual read-write connection (seehttps://bugzilla.redhat.com/show_bug.cgi?id=1518543). I don't see anyway to accomplish this with oldstyle negotiation (but that doesn'tmatter these days); but with newstyle negotiation, there are at leasttwo possible implementations:

Idea 1: the server advertises a new global bit NBD_FLAG_NO_WRITE (ideasfor a better name?) in its 16-bit handshake flags; if the client replieswith the same bit set (documentation-wise, we'd name the client replyNBD_FLAG_C_NO_WRITE), then the server knows that the client promises tobe a read-only connection.

Idea 2: we add a new option, NBD_OPT_READ_ONLY. If the client sendsthis option, and the server replies with NBD_REP_ACK, then the serverknows that the client promises to be a read-only connection.

With either idea, once the server knows the client's intent to be aread-only client, the server SHOULD set NBD_FLAG_READ_ONLY on all(further) information sent for any export (whether fromNBD_OPT_EXPORT_NAME, NBD_OPT_INFO, or NBD_OPT_GO) and treat any exportas read-only for the current client, even if that export is in paralleluse by another read-write client, and the client MUST NOT sendNBD_CMD_WRITE, NBD_CMD_TRIM, NBD_CMD_WRITE_ZEROES, or any other commandthat requires a writable connection (the NBD_CMD_RESIZE extension comesto mind).

A client that wants to be read-only, but which does not see serversupport (in idea 1, the server did not advertise the bit; in idea 2, theserver replies with NBD_REP_ERR_UNSUP), does not have to do anythingspecial (it is always possible to do just reads to a read-writeconnection, and the server may still set NBD_FLAG_READ_ONLY even withoutsupporting the extension of permitting a client-side request). But sucha client may, if it wants to be nice to potential parallel writers onthe same export, decide to disconnect quickly (with NBD_OPT_ABORT orNBD_CMD_DISC as appropriate) rather than tie up a read-write connection.

I don't know which idea is more palatable. We have a finite set of only2^4 global handshake flags because it is a bitmask, where only 14 bitsremain; whereas we have almost 2^32 potential NBD_OPT_ values. On theother hand, using a global handshake flag means the server never showsany export as writable; while with the NBD_OPT_ solution, a guest canget different results for the sequence NBD_OPT_INFO, NBD_OPT_READ_ONLY,NBD_OPT_INFO. There's also the question with option 2 of whetherpermitting NBD_OPT_READ_ONLY prior to NBD_OPT_STARTTLS would make sense(is there any case where the set of TLS authentication to be performedcan involve looser requirements for a known-read-only client?), whereusing a global bit makes the sequence of required NBD_OPT_* a bit lessstateful.

Does the idea sound reasonable enough to propose wording to add it tothe NBD spec and an implementation in qemu? Which of the two ideas ispreferred for letting the client inform the server of its intent?


--
Eric Blake, Principal Software Engineer
Red Hat, Inc.           +1-919-301-3266
Virtualization:  qemu.org | libvirt.org

[Qemu-devel] RFC: Let NBD client request read-only mode

Reply via email to