Hi Thanks for the PRs ! I reviewed Anton's document, I will do a pass on the PRs.
Imho, it's important to get feedback from query engines, as, if delete vectors is not a problem per se (it's what we are using as internal representation), the use of Puffin files to store it is "impactful" for the query engines (probably some query engines might need to implement Puffin spec (read/write) using other language than Java, for instance Apache Impala). I like the proposal, I just hope we won't "surprise" some query engines with extra work :) Regards JB On Thu, Oct 10, 2024 at 11:41 PM rdb...@gmail.com <rdb...@gmail.com> wrote: > > Hi everyone, > > There seems to be broad agreement around Anton's proposal to use deletion > vectors in Iceberg v3, so I've opened two PRs that update the spec with the > proposed changes. The first, PR #11238, adds a new Puffin blob type, > delete-vector-v1, that stores a delete vector. The second, PR #11240, updates > the Iceberg table spec. > > Please take a look and comment! > > Ryan