Re: Iceberg 1.10.0 release update - July 1, 2025

2025-07-03 Thread Szehon Ho
Thanks Steven!On Jul 3, 2025, at 1:56 PM, Steven Wu wrote:> because it makes a bit of backward incompatible change by using the Spark behavior (for argument case sensitivity and type coercion support), we should do it in 1.10 release as we will have Spark 4.0 as part of it iiuc.  Else 1.11 will b

Re: Iceberg 1.10.0 release update - July 1, 2025

2025-07-03 Thread Steven Wu
Szehon's backport PR has been merged. Another blocker (dangling DVs for rewrite) was also merged. Core, Spark: Propagate orphaned delete files when rewriting data files There are still 3 blockers left, as a new PR (also danglinge DV related) is identi

Re: Iceberg 1.10.0 release update - July 1, 2025

2025-07-03 Thread Szehon Ho
Hi Steven Thanks. One more, what do we think about having https://github.com/apache/iceberg/pull/13106/ as part of 1.10 release? It's migrating Spark procedure to use Spark 4's new DSV2 Procedure framework. Its not a blocker but I feel, because it makes a bit of backward incompatible change by u

Re: Iceberg 1.10.0 release update - July 1, 2025

2025-07-03 Thread Steven Wu
> because it makes a bit of backward incompatible change by using the Spark behavior (for argument case sensitivity and type coercion support), we should do it in 1.10 release as we will have Spark 4.0 as part of it iiuc. Else 1.11 will be backward incompatible for the Spark 4.0 jar. Szehon, thank

Re: Iceberg Interval Types Proposal

2025-07-03 Thread Laurent Goujon
Like Russell, addition of new types which are widely used in analytics seems like a good thing. The document still has various open comments regarding the representation, and so I wonder if things have been settled or not. I'm also curious if this proposal will also be joined by proposals on Parqu

Re: cleanExpiredMetadata in RemoveSnapshots

2025-07-03 Thread Jean-Baptiste Onofré
Hi Gabor I would consider cleanExpiredMetadata as a table maintenance procedure. So, I agree that it should be managed by a catalog (as part of catalog policies and TMS). I'm not against to switch the cleanExpiredMetadata flag to true, and let the query engine and the catalog deal with that. Rega

Re: [discuss] pyiceberg 0.10.0 release

2025-07-03 Thread Jayce Slesar
Hey Kevin! In addition to most of those commits since the 0.9.1 release I think some good ones for the 0.10.0 milestone would be the following: Fixing UUID Support BigQuery Metastore Catal

Re: Issues with testcontainers

2025-07-03 Thread Claude Warren, Jr
I opened ticket https://github.com/apache/iceberg/issues/13452 to report and track this issue. On Thu, Jul 3, 2025 at 4:39 AM Jean-Baptiste Onofré wrote: > Hi > > That explains the problem we discussed together in your environment :) > > If it doesn't impact the other (majority) environments, I'

Re: Iceberg Interval Types Proposal

2025-07-03 Thread yun zou
Hi Laurent, Thank you for raising the Parquet and Arrow compatibility topic. The discussion is currently ongoing in the Parquet community. You can follow the thread here: https://lists.apache.org/thread/n8jdft4mltdcf91v7t8qf1hz5cg8nbnz Best Regards, Yun On Thu, Jul 3, 2025 at 8:42 AM Laurent Gou

Re: [DISCUSS] Proposal for Relative Path Support In Table Spec

2025-07-03 Thread ally heev
Hi folks, Do you accept contributions from people other than authors on improvement proposals? I would really like to contribute on the implementation side. Also, is the spec definition finalized? I see a couple of unaddressed queries in the doc On Wed, May 28, 2025 at 3:44 AM Talat Uyarer wrote