Re: Welcome Huaxin Gao as a committer!

2025-02-06 Thread Steve Zhang
Congratulations Huaxin, well deserved! Thanks, Steve Zhang > On Feb 6, 2025, at 8:16 AM, Xingyuan Lin wrote: > > Congrats Huaxin! > > On Thu, Feb 6, 2025 at 11:11 AM Denny Lee <mailto:denny.g@gmail.com>> wrote: >> Congratulations Huaxin!!! >> >

Re: [DISCUSS/VOTE] Add in ChangeLog Reserved Field IDs to Spec and Decrement Row Lineage Reserved IDs

2025-01-26 Thread Steve Zhang
+1 (non-binding) Thanks, Steve Zhang > On Jan 25, 2025, at 10:48 AM, huaxin gao wrote: > > +1 (non-binding)

Re: [Discuss][Vote] Spec Change - Add optional field added-rows to Snapshot for Row Lineage

2025-01-16 Thread Steve Zhang
Thank you Russell! +1 (non-binding) Thanks, Steve Zhang > On Jan 15, 2025, at 10:53 PM, huaxin gao wrote: > > +1 (non-binding)

Re: [VOTE] Document Snapshot Summary Optional Fields as Appendix in Spec

2025-01-14 Thread Steve Zhang
+1 non-binding Thanks, Steve Zhang > On Jan 14, 2025, at 1:14 PM, Kevin Liu wrote: > > +1 non-binding.

Re: New committer: Scott Donnelly

2024-12-11 Thread Steve Zhang
Congratulations Scott! Thanks, Steve Zhang > On Dec 11, 2024, at 4:47 AM, Fokko Driesprong wrote: > > Congratulations Scott! >

Re: New committer: Matt Topol

2024-12-10 Thread Steve Zhang
Congrats Matt! Thanks, Steve Zhang > On Dec 10, 2024, at 7:24 AM, Gang Wu wrote: > > Congrats Matt!

Re: [Discuss] Simplify tableExists API in HiveCatalog

2024-12-02 Thread Steve Zhang
plans to change other existing behaviors. I've addressed the feedback from reviewers and also added explicit tests coverage, PR is ready for another look in https://github.com/apache/iceberg/pull/11597. Thanks, Steve Zhang > On Nov 27, 2024, at 10:15 PM, Péter Váry wrote: > > +

[Discuss] Simplify tableExists API in HiveCatalog

2024-11-21 Thread Steve Zhang
://github.com/apache/iceberg/pull/11597 [2]: https://github.com/apache/iceberg/blob/3badfe0c1fcf0c0adfc7aa4a10f0b50365c48cf9/open-api/rest-catalog-open-api.yaml#L1129-L1133 Best regards, Steve Zhang

Re: [DISCUSS] Deprecate embedded manifests

2024-11-21 Thread Steve Zhang
+1 to deprecate Thanks, Steve Zhang > On Nov 19, 2024, at 3:32 AM, Fokko Driesprong wrote: > > Hi everyone, > > I would like to propose to deprecate embedded manifests > <https://github.com/apache/iceberg/pull/11586>. This has been used before the > manifest

Re: [VOTE] Deprecate and remove last-column-id

2024-11-19 Thread Steve Zhang
+1 nb Thanks, Steve Zhang > On Nov 19, 2024, at 12:18 AM, Fokko Driesprong wrote: > > Hi everyone, > > Based on the positive feedback on the [DISCUSS] thread > <https://lists.apache.org/thread/jz5s7pm2bhbm87ft495d6yrsh3bqvtb9> and the > pull-request on GitHub

Re: [DISCUSS] Spark 3.3 support?

2024-11-13 Thread Steve Zhang
+1 to deprecating and removing it Thanks, Steve Zhang > On Nov 13, 2024, at 11:23 AM, Eduard Tudenhöfner > wrote: > > +1 to deprecating and removing it

Re: [VOTE] Deletion Vectors in V3

2024-10-31 Thread Steve Zhang
+1 (non-binding) Thanks, Steve Zhang > On Oct 31, 2024, at 3:41 PM, rdb...@gmail.com wrote: > > +1 > > Thanks, Anton! > > On Wed, Oct 30, 2024 at 11:58 PM Fokko Driesprong <mailto:fo...@apache.org>> wrote: >> +1 >> >> I had to read up a

Re: [DISCUSS] Remove iceberg-pig module ?

2024-10-17 Thread Steve Zhang
+1 Thanks, Steve Zhang > On Oct 17, 2024, at 11:16 PM, roryqi wrote: > > +1. > > Péter Váry mailto:peter.vary.apa...@gmail.com>> > 于2024年10月18日周五 13:44写道: >> +1 >> >> On Fri, Oct 18, 2024, 04:50 Manu Zhang > <mailto:owenzhang1...@gmail.com&

Re: [DISCUSS] Drop Hive 2 support

2024-09-09 Thread Steve Zhang
+1 Thanks, Steve Zhang > On Sep 9, 2024, at 11:45 AM, Russell Spitzer > wrote: > > +1 > > On Mon, Sep 9, 2024 at 7:59 AM Eduard Tudenhöfner <mailto:etudenhoef...@apache.org>> wrote: >> +1 on deprecating Hive 2 in Iceberg 1.7 and removing it in 1.8 >

Re: [VOTE] Merge REST Spec Change To Add New Scan Planning APIs

2024-09-06 Thread Steve Zhang
, Steve Zhang > On Sep 4, 2024, at 9:53 AM, Chertara, Rahil > wrote: > > An endpoint fetchScanTasks was added in order for a client to get the > file-scan-tasks associated with a plan-task by providing a plan-task as input. >

Re: [Discuss] test logging is broken and Avro 1.12.0 upgraded slf4j-api dep to 2.x

2024-08-26 Thread Steve Zhang
I believe dependabot tried to upgrade self4j to 2.x in [1] but JB mentioned there's -1 on this upgrade, maybe he has more context. [1]https://github.com/apache/iceberg/pull/9688 Thanks, Steve Zhang > On Aug 24, 2024, at 7:37 PM, Steven Wu wrote: > > Hi, > > It seems

Re: clarification on changelog behavior for equality deletes

2024-08-22 Thread Steve Zhang
cannot be used together. Thanks, Steve Zhang > On Aug 22, 2024, at 8:50 AM, Steven Wu wrote: > > > It should emit changes for each snapshot in the requested range. > > Wing Yew has a good point here. +1 > > On Thu, Aug 22, 2024 at 8:46 AM Wing Yew Poon > wrote:

Re: clarification on changelog behavior for equality deletes

2024-08-21 Thread Steve Zhang
I agree that option (a) is what user expects for row level changes. I feel the added deletes in given snapshots provides a PK of DELETED entry, existing deletes are used to read together with data files to find DELETED value (V1b) and result of columns. Thanks, Steve Zhang > On Aug

Re: [VOTE] Spec changes in preparation for v3

2024-08-19 Thread Steve Zhang
+1 (non-binding) Thanks, Steve Zhang > On Aug 19, 2024, at 1:47 PM, John Zhuge wrote: > > +1 (non-binding) > > On Mon, Aug 19, 2024 at 1:34 PM Yufei Gu <mailto:flyrain...@gmail.com>> wrote: >> +1 >> Yufei >> >> >> On Mon,

Re: Welcome Péter, Amogh and Eduard to the Apache Iceberg PMC

2024-08-13 Thread Steve Zhang
Congrats everyone, well deserved! Thanks, Steve Zhang > On Aug 13, 2024, at 1:31 PM, Bill Zhang wrote: > > Congratulations everyone. > > On Tue, Aug 13, 2024 at 1:28 PM Szehon Ho <mailto:szehon.apa...@gmail.com>> wrote: >> Congratulations all, very well des

Re: [VOTE] Clarify "File System Tables" in the table spec

2024-08-01 Thread Steve Zhang
+1 (non-binding) Thanks, Steve Zhang > On Aug 1, 2024, at 2:25 PM, John Zhuge wrote: > > +1 (non-binding) > > On Thu, Aug 1, 2024 at 10:55 AM Amogh Jahagirdar <2am...@gmail.com > <mailto:2am...@gmail.com>> wrote: >> +1 (non-binding) >> &g

Re: [VOTE] Drop Java 8 support in Iceberg 1.7.0

2024-07-26 Thread Steve Zhang
+1 (non-binding) Thanks, Steve Zhang > On Jul 26, 2024, at 9:15 AM, Amogh Jahagirdar <2am...@gmail.com> wrote: > > +1 (non-binding)

Re: Java String to Expression Util?

2024-07-25 Thread Steve Zhang
/caf03aed926665014c22cc4a68902bf684f258f9/core/src/main/java/org/apache/iceberg/expressions/ExpressionParser.java#L262-L264 Thanks, Steve Zhang > On Jul 25, 2024, at 11:54 AM, Pucheng Yang wrote: > > Hi dev community, > > If I read the codebase correctly, there seems to be no utility for converting

Re: [ANNOUNCE] Welcoming new committers and PMC members

2024-07-23 Thread Steve Zhang
Congrats everyone! Thanks, Steve Zhang > On Jul 23, 2024, at 9:20 AM, Anton Okolnychyi wrote: > > Congrats everyone!

Re: Dropping JDK 8 support

2024-07-23 Thread Steve Zhang
+1 (non-binding) Thanks, Steve Zhang > On Jul 22, 2024, at 10:13 PM, Ajantha Bhat wrote: > > +1 (non-binding)

Re: [ANNOUNCE] Welcoming new committers and PMC members

2024-07-23 Thread Steve Zhang
Congrats everyone! Thanks, Steve Zhang > On Jul 23, 2024, at 9:20 AM, Anton Okolnychyi wrote: > > Congrats everyone!

Re: [VOTE] Release Apache Iceberg 1.6.0 RC1

2024-07-22 Thread Steve Zhang
+1 non-binding Checked signature, SHA512 and license, built and ran tests against java 17 Thanks, Steve Zhang > On Jul 22, 2024, at 3:29 PM, Jack Ye wrote: > > +1 (binding) > > Checked signature, checksum, license > Ran unit and integration tests with JDK17 > Ran ma

Re: [DISCUSS] DROP PARTITION in Spark

2024-07-17 Thread Steve Zhang
-iceberg-do-differently Thanks, Steve Zhang > On Jul 17, 2024, at 2:36 PM, Walaa Eldin Moustafa > wrote: > > Hi Jean, One use case is Hive to Iceberg migration, where DROP PARTITION does > not need to change to DELETE queries prior to the migration. > > That said, I am not in

Re: [DISCUSS] Enable the discussion tab for iceberg github repos

2024-07-11 Thread Steve Zhang
+1, looking forward to see it in action, would 3 months be a good evaluation window for rest of the iceberg repo? Thanks, Steve Zhang > On Jul 11, 2024, at 9:52 AM, Yufei Gu wrote: > > +1. It is a no-brainer to me given it is more search-engine friendly compared > to slack a

Re: [Vote] Deprecate oauth tokens endpoint

2024-07-10 Thread Steve Zhang
+1 (non binding) Thanks, Steve Zhang > On Jul 10, 2024, at 7:31 AM, Renjie Liu wrote: > > +1 (non binding)

Re: [VOTE] Fix property names in REST spec for statistics / partition statistics

2024-07-10 Thread Steve Zhang
+1 (non binding) Thanks, Steve Zhang > On Jul 10, 2024, at 1:10 AM, Jean-Baptiste Onofré wrote: > > +1 (non binding)

Re: Making the NDV property required for theta sketch blobs in Puffin

2024-06-22 Thread Steve Zhang
+1 for making the NDV property required in blob metadata Thanks, Steve Zhang > On Jun 21, 2024, at 2:54 PM, Amogh Jahagirdar <2am...@gmail.com> wrote: > > make the property required

Re: [Early Feedback] Variant and Subcolumnarization Support

2024-05-14 Thread Steve Zhang
Great ideas! Looking forward to the community focused proposal in details and how it can benefit iceberg! Thanks, Steve Zhang > On May 10, 2024, at 10:06 PM, Tyler Akidau > wrote: > > Subcolumnarization of variant columns allows query engines to efficiently > prune datasets

Re: New committer: Renjie Liu

2024-03-11 Thread Steve Zhang
Congrats Renjie! Thanks, Steve Zhang > On Mar 11, 2024, at 12:18 PM, Szehon Ho wrote: > > Congratulations! > > On Mon, Mar 11, 2024 at 12:43 PM Jack Ye <mailto:yezhao...@gmail.com>> wrote: >> Congratulations Renjie! >> >> Best, >> Jack Y

Re: New committer: Bryan Keller

2024-03-05 Thread Steve Zhang
Congrats Bryan, well deserved! Thanks, Steve Zhang > On Mar 5, 2024, at 9:44 AM, Szehon Ho wrote: > > Congratulations Bryan, well deserved, great work on Iceberg ! > > On Tue, Mar 5, 2024 at 8:14 AM Jack Ye <mailto:yezhao...@gmail.com>> wrote: >> Congrats Bry

Re: [VOTE] Release Apache PyIceberg 0.6.0rc1

2024-01-31 Thread Steve Zhang
of 0.6.0! Thanks, Steve Zhang > On Jan 31, 2024, at 8:33 AM, Pucheng Yang wrote: > > nvm, I was under the wrong impression it was released already. Thanks. > > On Wed, Jan 31, 2024 at 8:31 AM Pucheng Yang <mailto:py...@pinterest.com>> wrote: >> 0.6.0 has been re

Re: [DISCUSS] PyIceberg 0.6.0 release

2024-01-26 Thread Steve Zhang
I am really excited to see the both append and overwrite support are finally checked and in favor of seeing it in 0.6.0 release. It’s a big milestone worth celebrating! I am happy to help with partitioned write and sort order. Thanks, Steve Zhang > On Jan 26, 2024, at 5:22 AM, Fo

Re: [DISCUSS] Iceberg community summit

2024-01-16 Thread Steve Zhang
I am also happy to contribute here! Thanks, Steve Zhang > On Jan 16, 2024, at 4:17 PM, Bill Zhang wrote: > > Same here. We'd like to volunteer and help out with this summit. > > On Mon, Jan 15, 2024 at 4:46 AM Eduard Tudenhoefner <mailto:edu...@tabular.io>> w

JDBC namespace existence check #8340

2023-10-19 Thread Steve Zhang
in https://github.com/apache/iceberg/issues/8321 and https://github.com/apache/iceberg/issues/8832. Thanks, Steve Zhang

Re: [VOTE] Release Apache Iceberg 1.4.1 RC0

2023-10-19 Thread Steve Zhang
+1 (non-binding) - validated checksum and signature - checked license docs & ran RAT checks - ran build and tests using JDK17 (problem with TestS3RestSigner and ADLSFileIOTest related but I think it’s setup related) Thanks, Steve Zhang > On Oct 19, 2023, at 4:23 AM, Ajantha Bhat

Re: Welcome new committers and PMC!

2023-05-03 Thread Steve Zhang
Congrats everyone! Well deserved and great job! Thanks, Steve Zhang > On May 3, 2023, at 5:52 PM, Prashant Singh wrote: > > Congratulations, Amogh, Eduard, Szehon Well deserved ! > > On Wed, May 3, 2023 at 3:07 PM Steven Wu <mailto:stevenz...@gmail.com>>

Re: Support create table like for Iceberg table?

2023-04-25 Thread Steve Zhang
https://iceberg.apache.org/docs/latest/spark-ddl/#create-table <https://iceberg.apache.org/docs/latest/spark-ddl/#create-table> Thanks, Steve Zhang > On Apr 25, 2023, at 1:46 PM, Pucheng Yang wrote: > > Hi all, > > I wonder how folks in the community deal with the cases where y

Re: [DISCUSS] Dropping Spark 2.4 support

2023-04-17 Thread Steve Zhang
+1 for dropping Spark 2.4 support and we can clean up doc as well such as https://iceberg.apache.org/docs/latest/spark-queries/#spark-24 Thanks, Steve Zhang > On Apr 13, 2023, at 12:53 PM, Jack Ye wrote: > > +1 for dropping 2.4 support >

Re: Welcome new PMC members!

2023-04-12 Thread Steve Zhang
Congratulations everyone! Thanks, Steve Zhang > On Apr 11, 2023, at 9:46 PM, Eduard Tudenhoefner wrote: > > Congrats to everyone! > > On Wed, Apr 12, 2023 at 6:14 AM Ajantha Bhat <mailto:ajanthab...@gmail.com>> wrote: > Congratulations to all. > > On Wed

Re: [DISCUSS] Removing python_legacy from the repo

2023-02-17 Thread Steve Zhang
Thank you Fokko and Ryan for your great work to reach feature parity. pyiceberg is the way to go! Thanks, Steve Zhang > On Feb 17, 2023, at 8:29 AM, Ryan Blue wrote: > > +1 for removing it. And it's great to see the new one reaching feature parity! > > On Fri, Feb

Re: [VOTE] Release Apache PyIceberg 0.2.0

2022-12-06 Thread Steve Zhang
t; But I cannot seem to figure out what’s wrong here. Thanks, Steve Zhang > On Dec 6, 2022, at 3:36 PM, Ryan Blue wrote: > > Russell, we normally test with `make test`, which runs everything but the S3 > mock stuff since that runs in CI. That said, it would be great if we could

Re: [VOTE] Release Apache PyIceberg 0.1.0 RC2

2022-09-30 Thread Steve Zhang
Thank you Fokko, also forgot to update my vote to +1 given package version is clarified. Thank you for the great work! Steve Zhang > On Sep 30, 2022, at 8:02 AM, Driesprong, Fokko wrote: > > Hey Everyone, > > Thanks all for checking the release, and we can conclude the vot

Re: [VOTE] Release Apache PyIceberg 0.1.0 RC2

2022-09-25 Thread Steve Zhang
in local they are fine) Issues: - same version issue as Ryan pointed out Thanks, Steve Zhang > On Sep 25, 2022, at 10:37 AM, Ryan Blue wrote: > > +0 > > Looks great, except that the version isn’t correct: pyiceberg.__version__ > returns 0.1.0rc2 > > Passin

Re: Welcome Fokko Driesprong as a committer!

2022-08-22 Thread Steve Zhang
Congrats Fokko! Thanks, Steve Zhang > On Aug 22, 2022, at 2:05 PM, Szehon Ho wrote: > > Congratulations! > Szehon > > On Mon, Aug 22, 2022 at 12:25 PM Péter Váry <mailto:peter.vary.apa...@gmail.com>> wrote: > Congratulations Fokko! > > On Mon, A

Re: Data retention and expire the last snapshot

2022-07-01 Thread Steve Zhang
snapshot to be expired. I think it works for me. Appreciate your pointers Thanks, Steve Zhang > On Jun 29, 2022, at 4:32 PM, Russell Spitzer > wrote: > > Is "truncate" not an option? This would do a table wide delete which would > create a new snapshot which you can

Data retention and expire the last snapshot

2022-06-29 Thread Steve Zhang
iginal state right after table creation, before any data is written. Thanks, Steve Zhang

Re: Welcome Szehon Ho as a committer!

2022-03-11 Thread Steve Zhang
Congratulations Szehon, Well done! Thanks, Steve Zhang > On Mar 11, 2022, at 3:51 PM, Jack Ye wrote: > > Congratulations Szehon!! > > -Jack > > On Fri, Mar 11, 2022 at 3:45 PM Wing Yew Poon > wrote: > Congratulations Szehon! > > > On Fri, Mar 11,