Re: Internal key management system

Fabien COELHO Thu, 18 Jun 2020 23:44:18 -0700


Hello Masahiko-san,

What I referred to "only one key" is KEK.


Ok, sorry, I misunderstood.

Yeah, it depends on KMS, meaning we need different extensions for
different KMS. A KMS might support an interface that creates key if not
exist during GET but some KMS might support CREATE and GET separately.


I disagree that it is necessary, but this is debatable. The KMS-side
interface code could take care of that, eg:

   if command is "get X"
     if (X does not exist in KMS)
       create a new key stored in KMS, return it;
     else
       return KMS-stored key;
   ...

So you can still have a "GET" only interface which adapts to the "final"
KMS. Basically, the glue code which implements the interface for the KMS
can include some logic to adapt to the KMS point of view.


Is the above code is for the extension side, right?

Such a code could be in the command with which pg communicates (eg throughits stdin/stdout, or whatever) to get keys.

pg talks to the command, the command can do anything, such as storing keysor communicating with an external service to retrieve them, anythingreally, that is the point.

I'm advocating defining the pg/command protocol, something along "GET xxx"as you wrote, and possibly provide a possible/reasonable commandimplementation, which would be part of the code you put in your patch,only it would be in the command instead of postgres.

For example, if users want to use a cloud KMS, say AWS KMS, to storeDEKs and KEK they need an extension that is loaded to postgres and cancommunicate with AWS KMS. I imagine that such extension needs to bewritten in C,

Why? I could write it in bash, probably. Ok, maybe not so good for suid,but in principle it could be anything. I'd probably write it in C, though.

the communication between the extension uses AWS KMS API, and thecommunication between postgres core and the extension uses textprotocol.

I'm not sure of the word "extension" above. For me the postgres side couldbe an extension as in "CREATE EXTENSION". The command itself could beprovided in the extension code, but would not be in the "CREATEEXTENSION", it would be something run independently.

When postgres core needs a DEK identified by KEY-A, it asksfor the extension to get the DEK by passing something like “GET KEY-A”message, and then the extension asks the existence of that key to AWKKMS, creates if not exist and returns it to the postgres core. Is myunderstanding right?

Yes. The command in the use-case you outline would just be anintermediary, but for another use-case it would do the storing. The pointof aiming at extensibility if that from pg point of view the externalcommands provide keys, but what these commands really do to do this can beanything.

When we have TDE feature in the future, we would also need to change
frontend tools such as pg_waldump and pg_rewind that read database
files so that they can read encrypted files. It means that these
frond-end tools also somehow need to communicate with the external
place to get DEKs in order to decrypt encrypted database files. In
your idea, what do you think about how we can support it?

Hmmm. My idea was that the natural interface would be to get thingsthrough postgres. For a debug tool such as pg_waldump, probably it needsto be adapted if it needs to decrypt data to operate.

Now I'm not sure I understood, because of the DEK are managed by postgresin your patch, waldump and other external commands would have no access tothe decrypted data anyway, so the issue would be the same?

With file-level encryption, obviously all commands which have to read andunderstand the files have to be adapted if they are to still work, whichis another argument to have some interface rather than integratedserver-side stuff, because these external commands would need to be ableto get keys and use them as well.


Or I misunderstood something.

I'd like an extensible design to have anything in core. As I said in an
other mail, if you want to handle a somehow restricted use case, just
provide an external extension and do nothing in core, please. Put in core
something that people with a slightly different use case or auditor can
build on as well. The current patch makes a dozen hard-coded decisions
which it should not, IMHO.


It might have confused you that I included key manager and encryption
SQL functions to the patches but this key manager has been designed
dedicated to only TDE.

Hmmm. This is NOT AT ALL what the patch does. The documentation in yourpatch talks about "column level encryption", which is an applicationthing. Now you seem to say that it does not matter and can be removedbecause the use case is elsewhere.

It might be better to remove both SQL interface
and SQL key we discussed from the patch set as they are actually not
necessary for TDE purposes.

The documentation part of the patch, at no point, talks about TDE(transparent data encryption), which is a file-level encryption as far asI understand it, i.e. whole files are encrypted.

I'm lost, because if you want to do that you cannot easily usepadding/HMAC and so because they would change block sizes, and probablyyou would use CRT instead of CBC to be able to decrypt data selectively.


So you certainly succeeded in confusing me deeply:-)

Aside from the security risk you mentioned, it was a natural designdecision for me that we have our key manager component in postgres corethat is responsible for managing encryption keys for our TDE.

The patch really needs a README to explain what it really does, and why,and how, and what is the thread model, what are the choices (there shouldbe as few as possible), how it can/could be extended.

I've looked at the whole patch, and I could not find the place where filesare actually encrypted/decrypted at a low level, that I would expect forfile encryption implementation.

To make the key manager and TDE simple as much as possible, we discussedthat we will have cluster-wide TDE and key manager that manages a fewencryption keys used by TDE (e.g. one key for table/index encryption andanother key for WAL encryption), as the first step.

Hmmm. Ok. So in fact all that is for TDE, *but* the patch does not do TDE,but provides a column-oriented SQL-level encryption, which is unrelated toyour actual objective, which is to do file-level encryption in the end.

However, for TDE, it may that you cannot do it with a pg extension becausefor the extension to work the database must work, which would require some"data" files not to be encrypted in the first place. That seems like agood argument to actually have something in core.


Probably for TDE you only want the configuration file not to be encrypted.

I'd still advocate to have the key management system possibly outside ofpg, and have pg interact with it to get keys when needed. Probably key idswould be the relative file names in that case. The approach ofexternalizing encryption/decryption would be totally impractical forperformance reasons, though.

I see value in Cary Huang suggestion on the thread to have dynamicallyloaded functions implement an interface. That would at least allow toremove some hardcoded choices such as what cypher is actually used, keysizes, and so on. One possible implementation would be to manage thingsmore or less internally as you do, another to fork an external command andtalk with it to do the same.

However, I do not share the actual interface briefly outlined: I do notthinkpg should have to care about key management functions such asrotation, generation or derivation, storage… the interest of pg should belimited to retrieving the keys it needs. That does not mean such functionsdo not have security value and should not be implemented, I'd say that itshould not be visible/hardcoded in the pg/kms interface, especially ifthis interface is expected to be generic.

As I see it, a pg/kms C-level loadable interface would provide thefollowing function:

// options would be supplied by a guc and allow to initialize the// interface with the relevant data, whatever the underlying// implementation needs.

error kms_init(char *options);

// associate opaque key identifier to a local id
error kms_key(local_id int, int key_id_len, byte *key_id);

or maybe something like:

// would return the local id attributed to the key
error/int kms_key(key_id_len, key_id);

// the actual functions should be clarified

// for TDE file-level, probably the encrypted length is the same as the// input, you cannot have padding, hmac or whatever added.

// for SQL app-level, the rules could be different
error kms_(en|de)crypt(local_id int, int mode, int len,
                       byte *in, byte *out);

// maybe
error kms_key_forget(local_id int);
error kms_destroy(…);

// maybe, to allow extensibility and genericity
// eg kms_command("rotate keys with new kek=123");
error kms_command(char *cmd);

I'm a little bit unsure that there should be only one KMS active ever,though: a file-level vs app-level encryption could have quite differentconstraints. Also, should the app-level encryption be able to access keys

loaded for file-level encryption?

--
Fabien.

Re: Internal key management system

Reply via email to