Re: [HACKERS] [RFC][PATCH] wal decoding, attempt #2 - Design Documents (really attached)

m...@rpzdesign.com Sat, 22 Sep 2012 10:37:43 -0700

Andres, nice job on the writeup.

I think one aspect you are missing is that there must be some way forthe multi-masters tore-stabilize their data sets and quantify any data loss. You cannot dothis withoutsome replication intelligence in each row of each table so that nomatter how disastrousthe hardware/internet failure in the cloud, the system can HEAL itselfand keep going, no human beings involved.


I am laying down a standard design pattern of columns for each row:

MKEY - Primary key guaranteed unique across ALL nodes in the CLOUD withNODE information IN THE KEY. (A876543 vs B876543 or whatever)(networklink UP or DOWN)

CSTP - create time stamp on unix time stamp
USTP - last update time stamp based on unix time stamp
UNODE - Node that updated this record

Many applications already need the above information, might as wellstandardize it so external replication logic processing can self heal.

Postgresql tables have optional 32 bit int OIDs, you may want considerhaving a replication version of the ROID, replication object ID and thenexternalize the primary

key generation into a loadable UDF.

Of course, ALL the nodes must be in contact with each other not allowingsignficant drift on their clocks while operating. (NTP is a starter)

I just do not know of any other way to add self healing without theabove information, regardless of whether you hold up transactions forsynchronousor let them pass thru asynch. Regardless if you are getting yourreplication data from the WAL stream or thru the client libraries.

Also, your replication model does not really discuss busted linkreplication operations, where is the intelligence for that in theoperation diagram?

Everytime you package up replication into the core, someone has to tearinto that pile to add some extra functionality, so definitely thinkabout providing sensible hooks for that extra bit of customization tooverride the base function.


Cheers,

marco

On 9/22/2012 11:00 AM, Andres Freund wrote:

This time I really attached both...

Re: [HACKERS] [RFC][PATCH] wal decoding, attempt #2 - Design Documents (really attached)

Reply via email to