Alexey Serbin created KUDU-3453:
-----------------------------------

             Summary: Fine-grained anchoring for WAL segments during tablet 
copying
                 Key: KUDU-3453
                 URL: https://issues.apache.org/jira/browse/KUDU-3453
             Project: Kudu
          Issue Type: Improvement
          Components: tablet, tserver
            Reporter: Alexey Serbin


Tablet copying is a provision to implement the process of automatic tablet 
re-replication in Kudu.  When the system catalog (Kudu master) detects that a 
tablet replica is no longer available, it automatically re-replicates a tablet 
to a destination tablet server using another healthy tablet replica in the 
cluster as the source.

When copying a tablet from one tablet server to another, the source tablet 
copying session "anchors" WAL segments to be transfered to the destination 
server, so they are not GC-ed by the tablet maintenance operation when they are 
no longer needed locally, but the tablet copy session is still in progress.

The anchored WAL segments are releases all at once when the tablet copying 
session completes with success of failure.  However, there might be long 
running tablet copying sessions, and with high data ingest rate, the source 
tablet replica might accumulate huge amount of WAL data which isn't relevant at 
both the source and the destination server.

To prevent accumulation of WAL data for long-running tablet copying sessions, 
it's necessary to update the WAL anchors in a more granular manner, e.g. 
un-anchor a segment once it has been successfully copied and persisted by the 
client tablet copying session.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to