Re: [Proposal] REST Spec: Server-side Metadata Tables

Robert Stupp Thu, 04 Jul 2024 03:09:24 -0700

Hi Yufei,

I think the proposal is very interesting! The direction this and otherproposals are going is IMO the right one.

Since many proposals need access to at least manifest-lists and manifestfiles, potentially also data/delete files, does it make sense to bundleall proposals that need this ability?


Robert

On 03.07.24 22:44, Yufei Gu wrote:

Hi folks,

I'd like to discuss a new proposal to support server-side metadata tables.

One of Iceberg's most advantageous features is the ability to inspecta table using metadata tables. For instance, we can query snapshotsjust like we query data rows using the following command: SELECT *FROM prod.db.table.snapshots;

With the REST catalog, we can simplify this process further byproviding metadata directly from REST endpoints. Here are severalbenefits of this approach:


  * Engine Independence: The metadata tables do not rely on a specific
    implementation of an engine. The REST server returns the results
    directly. For example, the Rust Iceberg does not need to implement
    its own logic to query the snapshot table if it connects to a
    server with this capability. This reduces the complexity and
    development effort required for different clients and engines.
  * Enabled New Use Cases: A catalog UI or Lakehouse UI can present a
    table's metadata (e.g., snapshot/partition list) without relying
    on an engine like Trino. This opens up possibilities for
    lightweight UIs and tools that can directly interact with the REST
    endpoints to retrieve and display metadata.
  * Enhanced Performance: With server-side caching, the server-side
    metadata tables will perform better. Caching reduces the need to
    repeatedly compute or retrieve metadata, leading to faster
    response times and reduced load on the underlying storage systems.

Here is the proposal in google doc:https://docs.google.com/document/d/1MVLwyMQtZ-7jewsQ0PuTvtJbpfl4HCoVdbowMqFTmfc/edit?usp=sharing


Estimated read time: 5 mins

Would really appreciate any feedback on this topic and proposal!


Yufei


--
Robert Stupp
@snazy

Re: [Proposal] REST Spec: Server-side Metadata Tables

Reply via email to