Re: Transparent column encryption

Peter Eisentraut Thu, 30 Mar 2023 13:10:19 -0700

On 30.03.23 20:35, Stephen Frost wrote:

I do feel that column encryption is a useful capability and there's
large parts of this approach that I agree with, but I dislike the idea
of having our clients be able to depend on what gets returned for
non-encrypted columns while not being able to trust what encrypted
column results are and then trying to say it's 'transparent'.  To that
end, it seems like just saying they get back a bytea and making it clear
that they have to provide the validation would be clear, while keeping
much of the rest.

[Note that the word "transparent" has been removed from the featurename. I just didn't change the email thread name.]

These thoughts are reasonable, but I think there is a tradeoff to bemade between having featureful data validation and enhanced security.If you want your database system to validate your data, you have to sendit in plain text. If you want to have your database system not see theplain text, then it cannot validate it. But there is still utility in it.

You can't really depend on what gets returned even in the non-encryptedcase, unless you cryptographically sign the schema against modificationor something like that. So realistically, a client that cares stronglyabout the data it receives has to do some kind of client-side validationanyway.

Note also that higher-level client libraries like JDBC effectively doclient-side data validation, for example when you callResultSet.getInt() etc.

This is also one of the reasons why the user facing type declarationexists. You could just make all encrypted columns of type "opaque" orsomething and not make any promises about what's inside. But clientAPIs sort or rely on the application being able to ask the result setfor what's inside a column value. If we just say, we don't know, thenapplications (or driver APIs) will have to be changed to accommodatethat, but the intention was to not require that. So instead we say,it's supposed to be int, and then if it's sometimes actually not int,then your application throws an exception you can deal with. This isarguably a better developer experience, even if it concerns the datatype purist.


But do you have a different idea about how it should work?

Expanding out from that I'd imagine, pie-in-the-sky
and in some far off land, having our data type in/out validation
functions moved to the common library and then adding client-side
validation of the data going in/out of the encrypted columns would allow
application developers to be able to trust what we're returning (as long
as they're using libpq- and we'd have to document that independent
implementations of the protocol have to provide this or just continue to
return bytea's).

As mentioned, some client libraries effectively already do that. Buteven if we could make this much more comprehensive, I don't see how thiscan ever actually satisfy your point. It would require that allparticipating clients apply validation all the time, and all otherclients can rely on that happening, which is impossible.

Re: Transparent column encryption

Reply via email to