Re: [Discuss] Geospatial Support

2024-12-06 Thread Szehon Ho
Hello all, In the last community sync, there seemed to be general agreement to split out the Geometry and Geography type, so we can get the spec proposal ( https://github.com/apache/iceberg/pull/10981) with the Geometry type in time for Iceberg V3. Geometry type is still very useful to a lot of f

Re: [DISCUSS] Relocate Parquet to Iceberg Core

2024-12-06 Thread Daniel Weeks
Hey Ajantha, I understand it was discussed before, but I think a lot of recent discussions around improvements for parquet metadata/stats/etc is good justification for revisiting the earlier discussion. Parquet metadata has been brought up in relation to improving stats handling (allowing trackin

Re: [DISCUSS] Relocate Parquet to Iceberg Core

2024-12-06 Thread Ajantha Bhat
Hi Dan, I proposed the same last year while working on partition stats. I can revive this PR if required, https://github.com/apache/iceberg/pull/8500 But we decided that `*iceberg-data`* can write these parquet stats files (metadata) and core can just register it. So, it is no longer needed for p

[DISCUSS] Relocate Parquet to Iceberg Core

2024-12-06 Thread Daniel Weeks
Everyone, I wanted to propose moving the parquet implementation from the 'iceberg-parquet' project to the 'iceberg-core' project. The original motivation for keeping these subprojects separate was due to Iceberg relying on avro (which is included in the core project) for metadata and keeping othe

Re: [VOTE] Release Apache PyIceberg 0.8.1rc1

2024-12-06 Thread Kevin Liu
Hi Honah and Eduard, Following up on the test failures you both encountered, I opened apache/iceberg-python issues #1409 to track this. Thanks for taking the time to verify the release! Best, Kevin Liu On Fri, Dec 6, 2024 at 9:41 AM Kevin Li

[ANNOUNCE] Apache PyIceberg release 0.8.1

2024-12-06 Thread Kevin Liu
Hi everyone, I'm pleased to announce the release of Apache PyIceberg 0.8.1! Apache Iceberg is an open table format for huge analytic datasets. Iceberg delivers high query performance for tables with tens of petabytes of data, along with atomic commits, concurrent writes, and SQL-compatible table

Re: [VOTE] Release Apache PyIceberg 0.8.1rc1

2024-12-06 Thread Kevin Liu
Thanks everyone for voting! The 72 hours have passed, and a minimum of 3 binding votes have been cast: - +1: 3 (binding), 2 (non-binding) - +0: 0 - -1: 0 The release candidate has been officially accepted as *PyIceberg 0.8.1*. Thank you again for your participation! The announcement will be sent o

Re: [VOTE] Release Apache Iceberg 1.7.1 RC1

2024-12-06 Thread Russell Spitzer
I forgot to send this, left it in drafts :D +1 - Ran test suite for Polaris on https://github.com/apache/polaris/pull/442 - Ran all Iceberg Tests - Checked Signatures and Sums - Confirmed Jar is signed by a loving father and Iceberg PMC member On Thu, Dec 5, 2024 at 7:31 PM Bryan Keller wrote:

Re: [VOTE] Release Apache PyIceberg 0.8.1rc1

2024-12-06 Thread Eduard Tudenhöfner
+1 (binding) but I've also had the same tests fail that Honah mentioned earlier (which are due to deprecated dependencies) On Wed, Dec 4, 2024 at 3:52 AM Sung Yun wrote: > +1 (non-binding) > > Checked signatures, checksums and validated license headers. > > Ran the coverage tests using python3.1

Re: Storing catalog directly on object store

2024-12-06 Thread Steve Loughran
I am not expressing any opinion on the product whatsoever. What I will note is that I have spent 8 weeks full time this year dealing with AWS Java SDK problems in the more foundational parts of the SDK. https://github.com/steveloughran/engineering-proposals/blob/trunk/refactoring-s3a.md#aws-sdk-v