[
https://issues.apache.org/jira/browse/HUDI-6735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Harshal Patil updated HUDI-6735:
--------------------------------
Description:
Snapshot load scan of historical table ( having majority of data in archived
timeline ) causes large batch processing .
Adding interface to support breaking snapshotload query into batches which can
have commitId as checkpoint .
> Add support for SnapshotQueryLoadSplit interface
> ------------------------------------------------
>
> Key: HUDI-6735
> URL: https://issues.apache.org/jira/browse/HUDI-6735
> Project: Apache Hudi
> Issue Type: Improvement
> Reporter: Harshal Patil
> Priority: Major
>
> Snapshot load scan of historical table ( having majority of data in archived
> timeline ) causes large batch processing .
> Adding interface to support breaking snapshotload query into batches which
> can have commitId as checkpoint .
--
This message was sent by Atlassian Jira
(v8.20.10#820010)