Danny Chen created HUDI-9381:
--------------------------------
Summary: Remove Ckp metadata of Flink writers
Key: HUDI-9381
URL: https://issues.apache.org/jira/browse/HUDI-9381
Project: Apache Hudi
Issue Type: Improvement
Components: writer-core
Reporter: Danny Chen
Assignee: voon
Fix For: 1.1.0
Since Flink 1.15, a new API was introduced to support responsive comminication
between write tasks and the coordinator:
https://issues.apache.org/jira/browse/FLINK-26077.
We can use this API to fetch the in-memory current instant from the coordinator
and remove/deprecate the ckp metadata.
The current impl of task gateway -> coordinator communitation:
* in WriteOperatorFactory#createStreamOperator, the OperatorEventGateway is
set up for each write function;
* in each write function's #snapshotState, the gateway sends the write
metadata event to the coordinator;
While in FLINK-26077, the OperatorEventGateway API was not changed, instead it
changed a more underneath API named TaskOperatorEventGateway, which can be
fetched through:
{code:java}
// parameters is an instance of StreamOperatorParameters<Object> in the
WriteOperatorFactory
parameters.getContainingTask().getEnvironment().getOperatorCoordinatorEventGateway();
{code}
And we can send responsive msg with the new API:
{code:java}
TaskOperatorEventGateway#sendRequestToCoordinator
{code}
So somehow we need to figure out a way to setup/register the
TaskOperatorEventGateway into each write function, we need to wrap ip the
OperatorID together with it becasue the method
TaskOperatorEventGateway#sendRequestToCoordinator needs it.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)