+1, really nice! Indexes are coming! On Fri, Jun 10, 2022 at 8:04 AM Szehon Ho <szehon.apa...@gmail.com> wrote:
> +1, it's an exciting step for Iceberg, look forward to all the new > statistics and secondary indices it will allow. > > Had a few questions of what the reference to Puffin file(s) will be in the > Iceberg spec, but it's orthogonal to Puffin file format itself. > > Thanks, > Szehon > > On Thu, Jun 9, 2022 at 3:32 PM Ryan Blue <b...@tabular.io> wrote: > >> +1 from me! >> >> There may also be people that haven't followed the design discussions and >> we can start a DISCUSS thread if needed. But if everyone is comfortable >> with the design and implementation, I think it's ready for a vote as well. >> >> Huge thanks to Piotr for getting this ready! I think the format is going >> to be really useful for both stats and indexes in Iceberg. >> >> On Thu, Jun 9, 2022 at 3:35 AM Piotr Findeisen <pi...@starburstdata.com> >> wrote: >> >>> Hi Everyone, >>> >>> I propose that we adopt Puffin file format as a file format for >>> statistics and indexes in Iceberg tables. >>> >>> Puffin file format specification: >>> https://github.com/apache/iceberg/blob/master/format/puffin-spec.md >>> (previous discussions: https://github.com/apache/iceberg/pull/4944, >>> https://github.com/apache/iceberg-docs/pull/69) >>> >>> Intend use: >>> * statistics in Iceberg tables (see >>> https://github.com/apache/iceberg/pull/4945 and associated proposed >>> implementation https://github.com/apache/iceberg/pull/4741) >>> * in the future: storage for secondary indexes >>> >>> Puffin file reader and writer implementation: >>> https://github.com/apache/iceberg/pull/4537 >>> >>> Thanks, >>> PF >>> >>> >> >> -- >> Ryan Blue >> Tabular >> > -- Best Regards