Hi Gabor

Thanks for creating this discussion thread. This is indeed a good topic to
discuss.

The idea was to have lightweight compression for the footer for cass when
Puffin files are bigger.
It is true that the implementation didn't follow the spec yet.
If we remove this from the Puffin spec, we will probably want to add it
later.

The Iceberg implementation was supposed to be based on aircompressor pure
Java implementation https://github.com/airlift/aircompressor/pull/142.
AFAICT, aircompressor started to favor (or be more OK with) native
implementations (because of Project Panama), so adding LZ4 framed
compression might be simpler these days.

I would prefer to spend the effort on completing the compression.

Best
Piotr




On Tue, 27 Aug 2024 at 14:29, Gabor Kaszab <gaborkas...@cloudera.com.invalid>
wrote:

> Hi Iceberg Community,
>
> I saw in the Puffin spec <https://iceberg.apache.org/puffin-spec> that
> the footer of the Puffin file or the blobs themselves could be compressed
> by LZ4. I checked the code
> <https://github.com/apache/iceberg/blob/main/core/src/main/java/org/apache/iceberg/puffin/PuffinFormat.java#L110>
> however, and for me it seems that currently LZ4 is not supported.
> My first question is do I miss anything here?
> The second, is if we in fact don't support LZ4, can I remove it from the
> spec to avoid confusions? (I believe this requires a vote in a separate
> thread)
>
> Thanks,
> Gabor
>
>

Reply via email to