Danny Chen created HUDI-9381:
--------------------------------

             Summary: Remove Ckp metadata of Flink writers
                 Key: HUDI-9381
                 URL: https://issues.apache.org/jira/browse/HUDI-9381
             Project: Apache Hudi
          Issue Type: Improvement
          Components: writer-core
            Reporter: Danny Chen
            Assignee: voon
             Fix For: 1.1.0


Since Flink 1.15, a new API was introduced to support responsive comminication 
between write tasks and the coordinator: 
https://issues.apache.org/jira/browse/FLINK-26077.

We can use this API to fetch the in-memory current instant from the coordinator 
and remove/deprecate the ckp metadata.

The current impl of task gateway -> coordinator communitation:
 * in WriteOperatorFactory#createStreamOperator, the OperatorEventGateway is 
set up for each write function;
 * in each write function's #snapshotState, the gateway sends the write 
metadata event to the coordinator;

While in FLINK-26077, the OperatorEventGateway API was not changed, instead it 
changed a more underneath API named TaskOperatorEventGateway, which can be 
fetched through:
{code:java}
// parameters is an instance of StreamOperatorParameters<Object> in the 
WriteOperatorFactory
parameters.getContainingTask().getEnvironment().getOperatorCoordinatorEventGateway();
{code}

And we can send responsive msg with the new API:

{code:java}
TaskOperatorEventGateway#sendRequestToCoordinator
{code}

So somehow we need to figure out a way to setup/register the 
TaskOperatorEventGateway into each write function, we need to wrap ip the 
OperatorID  together with it becasue the method 
TaskOperatorEventGateway#sendRequestToCoordinator needs it.




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to