[PROPOSAL] Partially Loading Metadata - LoadTable V2

Haizhou Zhao Wed, 09 Oct 2024 17:05:50 -0700

Hello Dev List,


I want to bring this proposal to discussion:


https://docs.google.com/document/d/1eXnT0ZiFvdm_Zvk6fLGT_UxVWO-HsiqVywqu1Uk8s7E/edit#heading=h.uad1lm906wz4



It proposes a new LoadTable API (branded LoadTableV2 at the moment) on REST
spec that allows partially loading table metadata. The motivation is to
stabilize and optimize Spark write workloads, especially on Iceberg tables
with big metadata (e.g. due to huge list of snapshot/metadata log,
complicated schema, etc.). We want to leverage this proposal to reduce
operational and monetary cost of Iceberg & REST catalog usages, and achieve
higher commit frequencies (DDL & DML included) on top of Iceberg tables
through REST catalog.



Looking forward to hearing feedback and discussions.


Thank you,

Haizhou

[PROPOSAL] Partially Loading Metadata - LoadTable V2

Reply via email to