Auditing via logical decoding

Philip Scott Fri, 27 Jul 2018 03:41:44 -0700

Hi Postgres Hackers,

We have been using our own trigger-based audit system at my firmsuccessfully for some years, but the performance penalty is starting tograte a bit and so I have been tasked with seeing if we can make use ofthe new logical decoding functions to achieve the same thing. I thoughtthat someone must already have written something that would satisfy ouruse-case but my internet searches have come up short so far so I amconsidering writing a logical decoding plugin to do what we want.

I thought I would run the idea past you all here just in case my plan iscrazy; I’ve browsed around the postgres source code a bit before butI’ve never really gotten my hands dirty and am a little bit nervousabout putting my own C code into the heart of our DBMS so if this comesto anything I would like to offer my code up for review and/or possibleinclusion as a contributed module.


A quick summary of requirements:

We want to log (to a separate, remote database)

- One row for every transaction that changes the state of thedatabase.We call this table ‘audit_entry’ and contains the xid, transactiontimestamp, username, client hostname, and application name of thesession that caused the change.- One row for each change made by each transaction which records thestate of the tuple before the change.We call this table ‘audit_detail’ and contains xid, statementtimestamp, table name & schema, event_type, primary_key (hstore),old_row (hstore), and the text of the query that was responsible for thechange.

A lot of that information is available already by listening to thepgoutput decoding, and my first thought was that I could just write areceiver for that. However, application name, username, client hostnameand current_query() are not available. This is understandable as theyaren’t useful for logical replication.


I was about to give up, when I discovered pg_logical_emit_message.

My current thoughts are to:

- Write this extra data into a logical message while the transactionis still in progess

    Either with a deferred trigger per table or, perhaps better
        Find some global commit-time (or xid-assigment time) hook emit it there

  - Then get the information out of the database:

Either modify the existing pgoutput plugin & protocol to forwardsuch messages in its stream,

    Or write a dedicated ‘audit’ decoding plugin with its own protocol

  - Then get the information into the ‘auditing’ database:

Either with some standalone process that connects to both, consumesthe output created above, translates it to SQL to run in the auditingDB.Figure out how to create a proper postgres background process to doit, in a similar fashion to the logical replication worker


Any input you folks have would be very much appreciated.

Kinds Regards,

Philip

PS: If there is someone out there who is willing & able to build thisfor less than my company will have to pay me to do it, please drop me aline ☺

Auditing via logical decoding

Reply via email to