Re: [ANNOUNCE] Welcome Prashant Singh as a new Apache Iceberg Committer

2025-07-22 Thread huaxin gao
Congratulations Prashant!! Best, Huaxin On Tue, Jul 22, 2025 at 12:42 PM Alexandre Dutra wrote: > Congratulations Prashant!! > > Thanks, > Alex > > Le mar. 22 juil. 2025 à 20:00, Naresh P R a écrit : > >> Congratulations Prashant ! >> --- >> Regards, >> Naresh P R >> >> On Tue, Jul 22, 2025 at

Re: [VOTE] Release Apache Iceberg 1.9.1 RC1

2025-05-27 Thread huaxin gao
+1 (non-binding) Verified signature, checksum, license and ran some tests. On Tue, May 27, 2025 at 9:06 AM Russell Spitzer wrote: > For all those who haven't seen this before, GPG key signing is a very > "early hacker" sort of thing. The idea is the only way to trust a signature > is to > have i

Re: [VOTE] [REST SPEC] Add row lineage fields.

2025-05-23 Thread huaxin gao
+1 (non-binding) On Fri, May 23, 2025 at 9:20 AM Yufei Gu wrote: > +1 (binding) > Yufei > > > On Fri, May 23, 2025 at 9:18 AM Jean-Baptiste Onofré > wrote: > >> +1 (non binding) >> >> Regards >> JB >> >> Le ven. 23 mai 2025 à 00:38, Prashant Singh a >> écrit : >> >>> Hi All, >>> I propose an u

Re: [VOTE] Adopt the v3 spec changes

2025-05-19 Thread huaxin gao
+1 (non-binding) On Mon, May 19, 2025 at 11:17 PM Eduard Tudenhöfner < etudenhoef...@apache.org> wrote: > +1 (binding) > > On Tue, May 20, 2025 at 8:14 AM Gidon Gershinsky wrote: > >> +1 (non-binding). >> Glad to see this big step forward. >> >> Cheers, Gidon >> >> >> On Tue, May 20, 2025 at 9:0

Re: [VOTE] Release Apache Iceberg 1.9.1 RC0

2025-05-18 Thread huaxin gao
+1 (non-binding) Verified signature, checksum and license. Thanks Russell for driving this release! Huaxin On Sun, May 18, 2025 at 2:03 PM Fokko Driesprong wrote: > +1 (binding) > > Checked signature, checksum, and licenses. > > Thanks Russell, for running this release! > > Kind regards, > Fokk

Spark 4.0/Iceberg Integration Merged – Spark 3.5 Merges Can Resume

2025-05-14 Thread huaxin gao
Dear all, Thank you so much for your patience and support! The Spark 4.0/Iceberg integration PR has now been merged. You can go ahead and resume normal merging on Spark 3.5. Really appreciate everyone’s help in coordinating this. Huaxin

Kind Request: Could We Please Hold Spark 3.5 Merges Briefly for Spark 4.0/Iceberg Integration?

2025-05-14 Thread huaxin gao
Dear community, Would it be possible to hold off on merging the Spark 3.5 changes for a few hours, until the Spark 4.0/Iceberg integration PR is in? It would help me avoid repeated rebasing. I really appreciate everyone’s support—thank you! Thanks, Huaxin

Re: [VOTE] Minor clarification for Geo Spec

2025-05-07 Thread huaxin gao
+1 (non-binding) On Wed, May 7, 2025 at 9:29 AM Denny Lee wrote: > +1 (non-binding) > > On Wed, May 7, 2025 at 8:37 AM Daniel Weeks wrote: > >> +1 (binding) >> >> On Wed, May 7, 2025 at 7:24 AM Russell Spitzer >> wrote: >> >>> +1 (bind) >>> >>> On Wed, May 7, 2025 at 7:32 AM Eduard Tudenhöfner

Re: [VOTE] Spec Update: Variant Field Lower/Upper Bounds

2025-04-18 Thread huaxin gao
+1 (non-binding) On Fri, Apr 18, 2025 at 12:20 PM Amogh Jahagirdar <2am...@gmail.com> wrote: > +1 (binding) > > On Fri, Apr 18, 2025 at 1:16 PM Prashant Singh > wrote: > >> +1 (non-binding) >> Best, >> Prashant Singh >> >> On Fri, Apr 18, 2025 at 11:58 AM Yufei Gu wrote: >> >>> +1(binding) >>>

Re: [VOTE] Simplify multi-argument field-id(s) encoding

2025-04-18 Thread huaxin gao
+1 (non-binding) On Fri, Apr 18, 2025 at 9:19 AM Steve Zhang wrote: > +1 (non-binding) > > Thanks, > Steve Zhang > > > > On Apr 18, 2025, at 12:53 AM, Prashant Singh > wrote: > > +1 (non-binding) > > >

Re: [VOTE] Update row lineage spec ID assignment

2025-04-17 Thread huaxin gao
+1 (non-binding) On Thu, Apr 17, 2025 at 4:22 PM Denny Lee wrote: > +1 (non-binding) > > On Thu, Apr 17, 2025 at 5:14 PM Aihua Xu wrote: > >> + (non-binding). >> >> On Thu, Apr 17, 2025 at 11:22 AM Steven Wu wrote: >> >>> +1 (binding) >>> >>> On Thu, Apr 17, 2025 at 11:09 AM Amogh Jahagirdar <

Re: [VOTE] Row lineage required for v3

2025-03-31 Thread huaxin gao
+1 (non-binding) On Mon, Mar 31, 2025 at 7:44 PM Renjie Liu wrote: > +1 > > On Tue, Apr 1, 2025 at 10:33 AM Denny Lee wrote: > >> +1 (non-binding) >> >> On Mon, Mar 31, 2025 at 7:27 PM roryqi wrote: >> >>> +1. >>> >>> Gang Wu 于2025年4月1日周二 09:30写道: >>> +1 (non-binding) On Tue, A

Re: [VOTE] Minor simplifications for Geo Spec

2025-03-22 Thread huaxin gao
+1 (non-binding) On Sat, Mar 22, 2025 at 6:32 PM Prashant Singh wrote: > +1 (non binding) > > Best, > Prashant > > On Fri, Mar 21, 2025 at 10:03 AM Russell Spitzer < > russell.spit...@gmail.com> wrote: > >> +1 (bind >> >> On Fri, Mar 21, 2025 at 11:53 AM Steve Zhang >> wrote: >> >>> +1 (non-bin

Re: [VOTE] Allow Row-Lineage with Equality Deletes

2025-02-20 Thread huaxin gao
+1 (non-binding) Thanks Russell! On Thu, Feb 20, 2025 at 1:57 AM Fokko Driesprong wrote: > +1 > > Thanks Russell! > > Op do 20 feb 2025 om 10:25 schreef Péter Váry >: > >> +1 >> >> Manu Zhang ezt írta (időpont: 2025. febr. 20., >> Cs, 8:06): >> >>> +1 (non-binding) >>> >>> Regards >>> Manu >>

Re: [VOTE] Add overwriteRequested to RegisterTableRequest in REST spec

2025-02-13 Thread huaxin gao
+1 (non-binding) On Thu, Feb 13, 2025 at 11:51 AM Anurag Mantripragada wrote: > +1 (non-binding) > > Thanks, Steve! > > ~ Anurag > > > > > > On Feb 13, 2025, at 10:34 AM, rdb...@gmail.com wrote: > > +1 > > On Thu, Feb 13, 2025 at 9:56 AM Huang-Hsiang Cheng > wrote: > >> +1 (non-binding) >> >> O

Re: [VOTE] Release Apache Iceberg 1.8.0 RC0

2025-02-12 Thread huaxin gao
+1 (non-binding) Verified signatures, checksums, build and ran some tests on my local. Thanks Amogh for driving the release! Best, Huaxin On Wed, Feb 12, 2025 at 11:28 AM Honah J. wrote: > +1 (binding) > > Verified signatures, checksum, build, and tests. > > Also noticed the same set of flaky

Re: Welcome Huaxin Gao as a committer!

2025-02-07 Thread huaxin gao
t; > > >>>>>>>>>>>> > > On Thu, Feb 6, 2025 at 2:46 PM Tushar Choudhary < >>>>>>>>>>>> > > tushar.choudhary...@gmail.com> wrote: >>>>>>>>>>>> > > >>>>>>>>>>>> > >

Re: [VOTE] Add Geometry and Geography types for V3

2025-02-07 Thread huaxin gao
+1 (non-binding) On Fri, Feb 7, 2025 at 12:03 PM Honah J. wrote: > +1 > > Best regards, > Honah > > On Fri, Feb 7, 2025 at 10:45 AM Aihua Xu wrote: > >> +1 (non-binding). >> >> On Fri, Feb 7, 2025 at 8:12 AM Jean-Baptiste Onofré >> wrote: >> >>> +1 >>> >>> That's a great progress ! Thanks ! >>

Re: Welcome Huaxin Gao as a committer!

2025-02-07 Thread huaxin gao
t;>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> Congratulations Huaxin. >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> On Thu, Feb 6, 2025 at 6:45 AM Sung Yun &g

Re: [VOTE] Update partition stats spec for V3

2025-02-01 Thread huaxin gao
+1 (non-binding) On Sat, Feb 1, 2025 at 8:50 AM Manish Malhotra < manish.malhotra.w...@gmail.com> wrote: > +1(nonbinding) > > On Sat, Feb 1, 2025 at 2:49 AM Russell Spitzer > wrote: > >> +1 >> >> On Sat, Feb 1, 2025 at 3:01 AM Anton Okolnychyi >> wrote: >> >>> Hi all, >>> >>> I propose the foll

Re: [VOTE] Release Apache Iceberg 1.7.2 rc0

2025-01-28 Thread huaxin gao
+1 (non-binding) Verified signature, checksums, license, built and ran some tests locally. On Tue, Jan 28, 2025 at 7:50 AM Eduard Tudenhöfner wrote: > +1 (verified sigs/checksums/license/ran tests locally) > > On Tue, Jan 28, 2025 at 7:00 AM Yuya Ebihara wrote: > >> +1 (non-binding) >> >> Chec

Re: [DISCUSS/VOTE] Add in ChangeLog Reserved Field IDs to Spec and Decrement Row Lineage Reserved IDs

2025-01-25 Thread huaxin gao
+1 (non-binding) On Sat, Jan 25, 2025 at 10:13 AM Péter Váry wrote: > +1 > Thanks for taking care of this! > > On Fri, Jan 24, 2025, 23:20 Yufei Gu wrote: > >> Thanks for fixing this, Russell! >> >> +1 for keeping the changelog view related id as is, given the changelog >> view has been widely

Re: [DISCUSS, VOTE] OpenAPI Metadata Update for EnableRowLineage

2025-01-23 Thread huaxin gao
+1 (non binding) Thanks Russell. On Thu, Jan 23, 2025 at 10:55 AM Fokko Driesprong wrote: > +1 > > Thanks Russell > > Op do 23 jan 2025 om 18:47 schreef Aihua Xu : > >> + (non binding). >> >> Thanks Russell. >> >> On Thu, Jan 23, 2025 at 2:05 AM Jean-Baptiste Onofré >> wrote: >> >>> +1 (non bi

Re: [VOTE] Document Snapshot Summary Optional Fields as Subsection of Appendix F in Spec

2025-01-21 Thread huaxin gao
+1 (non-binding) On Tue, Jan 21, 2025 at 6:04 PM Manu Zhang wrote: > +1 (non-binding) > > Thanks & Regards > > On Wed, Jan 22, 2025 at 8:06 AM Daniel Weeks wrote: > >> +1 (binding) >> >> On Tue, Jan 21, 2025 at 1:05 PM Szehon Ho >> wrote: >> >>> +1 (binding) >>> >>> Thanks >>> Szehon >>> >>> O

Re: [VOTE] Deprecate IRC snapshot-id Field of SetStatisticsUpdate

2025-01-21 Thread huaxin gao
+1 (non-binding) On Tue, Jan 21, 2025 at 4:20 PM Amogh Jahagirdar <2am...@gmail.com> wrote: > +1 Thank you Christian! > > On Tue, Jan 21, 2025 at 12:35 PM Sreeram Garlapati < > gsreeramku...@gmail.com> wrote: > >> +1 >> >> Thanks for cleaning this up. >> >> Best, >> Sreeram >> >> On Mon, Jan 20,

Re: [Discuss][Vote] Spec Change - Add optional field added-rows to Snapshot for Row Lineage

2025-01-15 Thread huaxin gao
+1 (non-binding) On Wed, Jan 15, 2025 at 10:51 PM Gang Wu wrote: > +1 (non-binding) > > On Thu, Jan 16, 2025 at 2:30 PM Péter Váry > wrote: > >> +1 >> >> Steven Wu ezt írta (időpont: 2025. jan. 16., Cs, >> 0:46): >> >>> +1 >>> >>> On Wed, Jan 15, 2025 at 9:00 AM Russell Spitzer < >>> russell.s

Re: [DISCUSS] Apache Iceberg (java) 1.8.0 release

2025-01-15 Thread huaxin gao
Can we also include the Comet and Iceberg integration? Here is the PR ; most of the comments have been addressed, and I am currently working with Anton to finalize this. Thanks, Huaxin On Mon, Jan 13, 2025 at 11:55 AM Amogh Jahagirdar <2am...@gmail.c

Re: [VOTE] Document Snapshot Summary Optional Fields as Appendix in Spec

2025-01-14 Thread huaxin gao
+1 non-binding On Tue, Jan 14, 2025 at 1:21 PM Steve Zhang wrote: > +1 non-binding > > Thanks, > Steve Zhang > > > > On Jan 14, 2025, at 1:14 PM, Kevin Liu wrote: > > +1 non-binding. > > >

RE: Re: [DISCUSS] Apache Iceberg Summit 2025 - Selection Committee

2024-12-18 Thread huaxin gao
I would love to help as well. Apologies for not replying before the deadline. Hope I can still be of assistance. Thanks, Huaxin On 2024/12/10 08:20:00 Jean-Baptiste Onofré wrote: > Hi everyone, > > Thanks everyone for volunteering to help on the selection committee. > > We are Dec 10th, the call

[DISCUSS] Standardizing Error Handling in the Iceberg Spark Module

2024-12-18 Thread huaxin gao
Hi everyone, While working on integrating Spark 4.0 with Iceberg, I noticed that error conditions in the Spark module are primarily validated through the content of error messages. I need to revise some of the validation because the error messages have changed in Spark 4.0. Spark has standardized

Re: New committer: Scott Donnelly

2024-12-11 Thread huaxin gao
Congratulations Scott! On Wed, Dec 11, 2024 at 9:07 AM Steve Zhang wrote: > Congratulations Scott! > > Thanks, > Steve Zhang > > > > On Dec 11, 2024, at 4:47 AM, Fokko Driesprong wrote: > > Congratulations Scott! > > >

Re: New committer: Matt Topol

2024-12-11 Thread huaxin gao
Congratulations, Matt! On Tue, Dec 10, 2024 at 11:13 PM Eduard Tudenhöfner < etudenhoef...@apache.org> wrote: > Congrats Matt! > > On Wed, Dec 11, 2024 at 6:41 AM Honah J. wrote: > >> Congratulations, Matt! >> >> On Tue, Dec 10, 2024 at 7:51 PM Fenil Jain wrote: >> >>> Congratulations Matt! >>>

Re: Welcome Péter, Amogh and Eduard to the Apache Iceberg PMC

2024-08-13 Thread huaxin gao
Congratulations, everyone! On Tue, Aug 13, 2024 at 1:53 PM Ryan Blue wrote: > Congratulations! Thanks for all your contributions! > > On Tue, Aug 13, 2024 at 1:48 PM Steve Zhang > wrote: > >> Congrats everyone, well deserved! >> >> Thanks, >> Steve Zhang >> >> >> >> On Aug 13, 2024, at 1:31 PM,

Re: [DISCUSS] Implementing a table-level statistics file to store column statistics

2024-08-06 Thread huaxin gao
ata present >>> for data files be possible? >>> To me, it seems like doing some amount of derivation at query time is >>> okay, as long as the time it takes to do the derivation doesn't increase >>> significantly as the table gets larger. >>> &g

Re: [DISCUSS] Implementing a table-level statistics file to store column statistics

2024-08-02 Thread huaxin gao
in, max, and null > counts. > > Best, > Piotr > > > > On Fri, 2 Aug 2024 at 20:47, Samrose Ahmed wrote: > >> Isn't this addressed by the partition statistics feature, or do you want >> to have one row for the entire table? >> >> On Fri, Aug 2,

[DISCUSS] Implementing a table-level statistics file to store column statistics

2024-08-02 Thread huaxin gao
I would like to initiate a discussion on implementing a table-level statistics file to store column statistics, specifically min, max, and null counts. The original discussion can be found in this Slack thread: https://apache-iceberg.slack.com/archives/C03LG1D563F/p1676395480005779. In Spark 3.4,

Re: Dropping JDK 8 support

2024-07-23 Thread huaxin gao
t in Iceberg 2.0 release". > It's fine for people to propose dropping JDK8 support sooner than that > (and I'm not against that), but the proposal being voted on should not be > switched mid-vote. > - Wing Yew > > > On Tue, Jul 23, 2024 at 10:45 PM huaxin gao > wr

Re: Dropping JDK 8 support

2024-07-23 Thread huaxin gao
in 1.6+ versions, which can be another > thread. > > On Wed, Jul 24, 2024 at 10:45 AM huaxin gao > wrote: > >> Hi Manu, >> Thanks for the discussion. Is your concern about customers who use JDK 8 >> with Spark 3.5? But we will face the same problem if we dr

Re: Dropping JDK 8 support

2024-07-23 Thread huaxin gao
Hi Manu, Thanks for the discussion. Is your concern about customers who use JDK 8 with Spark 3.5? But we will face the same problem if we drop JDK 8 in Iceberg 2.0, unless we plan to drop Spark 3.5 support in 2.0. Huaxin On Tue, Jul 23, 2024 at 7:30 PM Renjie Liu wrote: > Hi, Manu: > > > If we

Re: Dropping JDK 8 support

2024-07-23 Thread huaxin gao
harder because we're trying to get more things in a release. > Putting out a major release just for breaking API changes makes the most > sense to me. > > On Tue, Jul 23, 2024 at 9:50 AM Russell Spitzer > wrote: > >> +1 >> >> On Tue, Jul 23, 2024 at 11:4

Re: Dropping JDK 8 support

2024-07-23 Thread huaxin gao
ix-with-github-actions > > -Jack > > On Tue, Jul 23, 2024 at 9:15 AM huaxin gao wrote: > >> It seems my earlier question might have been overlooked. Could we clarify >> if JDK 8 support is being dropped in the next version? The proposal >> indicated for Iceberg 2

Re: Dropping JDK 8 support

2024-07-23 Thread huaxin gao
>>>> >>>>> On Tue, Jul 23, 2024 at 9:40 AM Szehon Ho >>>>> wrote: >>>>> >>>>>> +1 for dropping JDK 8 in Iceberg 2.0. I also wonder the same thing >>>>>> as Huaxin (sorry if I missed a previous thread on Iceb

Re: Dropping JDK 8 support

2024-07-22 Thread huaxin gao
+1 (non-binding) I have a question about iceberg versioning. After the 1.6 release, will there be versions 1.7, 1.8 and 1.9, or will it go straight to 2.0? On Mon, Jul 22, 2024 at 5:32 PM Manu Zhang wrote: > If JDK 8 support is dropped in 2.0, will we continue to fix critical > issues in 1.6+?

Re: Building with JDK 21

2024-07-19 Thread huaxin gao
+1 in favor of adding java 21 support +1 in favor of removing java 8 support I am currently working on Spark 4.0 / Iceberg integration . Spark 4.0 runs on Java 17/21. On Fri, Jul 19, 2024 at 4:58 AM Piotr Findeisen wrote: > Hi, > > We recently start

Re: [VOTE] Fix property names in REST spec for statistics / partition statistics

2024-07-09 Thread huaxin gao
+1 On Tue, Jul 9, 2024 at 10:50 PM Driesprong, Fokko wrote: > +1 (binding) > > Op wo 10 jul 2024 om 07:47 schreef Renjie Liu > >> +1 (non binding) >> >> On Wed, Jul 10, 2024 at 1:45 PM Daniel Weeks wrote: >> >>> +1 (binding) >>> >>> On Tue, Jul 9, 2024, 8:35 PM Eduard Tudenhöfner < >>> etudenh

Re: Making the NDV property required for theta sketch blobs in Puffin

2024-06-21 Thread huaxin gao
+1 for making the ndv blob metadata property required for theta sketches. On Fri, Jun 21, 2024 at 2:54 PM Amogh Jahagirdar <2am...@gmail.com> wrote: > Hey all, > > I wanted to raise this thread to discuss a spec change proposal > for making the ndv b

Re: Dynamically Support Spark Native Engine in Iceberg

2024-02-18 Thread huaxin gao
this. >> >> Cell : 425-233-8271 >> >> >> On Tue, Feb 13, 2024 at 4:38 PM huaxin gao >> wrote: >> >>> Hello Iceberg community, >>> >>> As you may already know, Project Comet >>> <https://github.com/apache/arrow-datafusion-

Dynamically Support Spark Native Engine in Iceberg

2024-02-13 Thread huaxin gao
Hello Iceberg community, As you may already know, Project Comet , a plugin to accelerate Spark query execution via leveraging DataFusion and Arrow, has been open sourced under the Apache Arrow umbrella. To capitalize on the capabilities of Project

Re: In Remembrance of Kyle

2022-12-05 Thread huaxin gao
I am extremely shocked and saddened to hear of Kyle's passing. When I made my very first Iceberg PR last August, Kyle reviewed it immediately and helped me on it, and he did so for almost all my PRs. I pulled out a couple of my old PRs just now and re-read his comments. He liked to put smiling fa