[ https://issues.apache.org/jira/browse/FLINK-27707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jane Chan updated FLINK-27707: ------------------------------ Description: As discussed offline, the new implementation has been modified. * Don't perform a scan during the planning phase. Instead, we put a flag along with part spec to indicate it is ordinary manual trigger compaction. * Introduce a new {{PrecommittingSinkWriter}} impl to perform dedicated compaction tasks. This writer is responsible for scanning and selecting partition and bucket according to the current sub-task id, and then creating a per-bucket compact writer to submit compaction. Since there's no data shuffled between source and sink, so all the compaction is performed when {{SinkWriterOperator#endInput}} is invoked. was:Perform the scan on the latest snapshot and pick data files to compact and serialize to JSON string. > Implement ManagedTableFactory#onCompactTable > -------------------------------------------- > > Key: FLINK-27707 > URL: https://issues.apache.org/jira/browse/FLINK-27707 > Project: Flink > Issue Type: Sub-task > Components: Table Store > Affects Versions: table-store-0.2.0 > Reporter: Jane Chan > Priority: Major > Labels: pull-request-available > Fix For: table-store-0.2.0 > > > As discussed offline, the new implementation has been modified. > * Don't perform a scan during the planning phase. Instead, we put a flag > along with part spec to indicate it is ordinary manual trigger compaction. > * Introduce a new {{PrecommittingSinkWriter}} impl to perform dedicated > compaction tasks. This writer is responsible for scanning and selecting > partition and bucket according to the current sub-task id, and then creating > a per-bucket compact writer to submit compaction. Since there's no data > shuffled between source and sink, so all the compaction is performed when > {{SinkWriterOperator#endInput}} is invoked. -- This message was sent by Atlassian Jira (v8.20.7#820007)