Re: Welcome Yufei Gu as a committer

2022-08-25 Thread Kyle Bendickson
hyi >>>> wrote: >>>> > >>>> > I’d like to welcome Yufei Gu as a committer to the project. >>>> > >>>> > Thanks for all your hard work, Yufei! >>>> > >>>> > - Anton >>>> >>> -- Kyle Bendickson OSS Developer | Tabular <https://tabular.io/> k...@tabular.io

Re: Welcome Fokko Driesprong as a committer!

2022-08-21 Thread Kyle Bendickson
I would like to welcome Fokko Driesprong as a new committer to the >> project! >> >> Thanks for all your contributions, Fokko! >> >> >> Ryan >> >> -- >> Ryan Blue >> Tabular >> > -- > > Sam Redai > > Developer Advoca

Proposal - Improving Github Issues Experience via Templates and Pruning Stale Issues

2022-07-12 Thread Kyle Bendickson
t;, with a comment to the user, after N days that the issue will be closed if there's no activity (currently 180 days). After an additional time period of X days (currently 14), if no activity occurs on the issue, it would be closed: https://github.com/apache/iceberg/pull/4949 <https://gith

Re: [ANNOUNCE] Apache Iceberg release 0.13.2

2022-06-15 Thread Kyle Bendickson
che.org/dyn/closer.cgi/iceberg/apache-iceberg-0.13.2/apache-iceberg-0.13.2.tar.gz >> >> Java artifacts are available from Maven Central. >> >> Thanks to everyone for contributing! >> > -- Kyle Bendickson OSS Developer | Tabular <https://tabular.io/> k...@tabular.io

Re: 【Feature】Request support for c++ sdk

2022-06-12 Thread Kyle Bendickson
this is a non-trivial effort to >> get >> > > underway. >> > > >> > > Another area to consider is compatibility testing. I think before a >> third >> > > officially supported community library is introduced it would be good >> to >> > &

Re: [VOTE] Adopt Puffin format as a file format for statistics and indexes

2022-06-12 Thread Kyle Bendickson
+1 [non-binding] Thank you Piotr for all of the work you’ve put into this. This should greatly benefit not only Iceberg on Trino, but hopefully can be used in many novel ways due to its well thought out generic design and incorporation of the ability to extend with new sketches. Looking forward

Re: 【Feature】Request support for c++ sdk

2022-06-05 Thread Kyle Bendickson
Looking forward to seeing you at the next community sync. A design document and/or any prior art would be very helpful as the community sync does discuss many topics (possibly there is existing C++ support in StarRocks for Iceberg V1?). Thank you, Kyle Bendickson GitHub: kbendick On Sun, Jun 5

Re: [VOTE] Release Apache Iceberg 0.13.2 RC1

2022-06-05 Thread Kyle Bendickson
, but the 0.13.x branch doesn't have the commit ID in question. Not sure if that's a major concern or not. Cheers, Kyle On Sun, Jun 5, 2022 at 11:51 AM Kyle Bendickson wrote: > Thanks Eduard! > > I have: > - verified the signature > - verified the checksum in the file gi

Re: [VOTE] Release Apache Iceberg 0.13.2 RC1

2022-06-05 Thread Kyle Bendickson
Thanks Eduard! I have: - verified the signature - verified the checksum in the file given as well as of the artifact - ran all unit tests on Java 11, all passed - ran all unit tests on Java 8, some hive-3 tests consistently fail (I do notice they passed on Github - but the tests which fail are con

Re: [discuss] keep the commit history when adding a new engine version

2022-05-13 Thread Kyle Bendickson
I agree this is a good point. The git history is not retained when we port the way we currently do. So +1 as I understand it, the latest version will generally be the one to have the most git commit history. Possibly looking back for changes that occurred due to some other version. Thanks Liwei!

Re: Welcome Szehon Ho as a committer!

2022-03-11 Thread Kyle Bendickson
Congratulations. Szehon! Well deserved! On Fri, Mar 11, 2022 at 4:06 PM Steven Wu wrote: > Congrat, Szehon! > > On Fri, Mar 11, 2022 at 4:05 PM Chao Sun wrote: > >> Congratulations Szehon! >> >> On Fri, Mar 11, 2022 at 4:01 PM OpenInx wrote: >> > >> > Congrats Szehon! >> > >> > On Sat, Mar 12

Re: [DISCUSS] The correct approach to estimate the byte size for an unclosed ORC writer.

2022-03-03 Thread Kyle Bendickson
Hi Openinx. Thanks for bringing this to our attention. And many thanks to hiliwei for their willingness to tackle big problems and little problems. I wanted to say that I think most anything that’s relatively close would be better than the current situation most likely (where the feature is disab

Re: [DISCUSS] Align the spark runtime artifact names among spark2.4, spark3.0, spark3.1 and spark3.2

2022-02-20 Thread Kyle Bendickson
Thanks for bringing this up Jeff! Normally I agree, it’s not a good practice to change artifact name. However, in this case, the artifact has changed already. The “spark3-runtime” used to be for all versions of Spark 3 (at the time Spark 3.0 and 3.1). It no longer is, as it’s only tested / used wi

Re: [VOTE] Release Apache Iceberg 0.13.1 RC0

2022-02-14 Thread Kyle Bendickson
+1 (non-binding) License checks, various smoke tests for create table, update, merge into, deletes, etc against Java 11 and Spark 3.2 and 3.1. - Kyle Bendickson On Mon, Feb 14, 2022 at 12:32 PM Ryan Blue wrote: > +1 (binding) > > * Ran license checks, verified checksum and signature

Re: New Versioned Iceberg Documentation Site

2022-02-01 Thread Kyle Bendickson
+1 from me. This looks great. Thank you for all your hard work, Sam! On Tue, Feb 1, 2022 at 10:33 AM Jack Ye wrote: > +1, amazing website! And now the website repo is separated we can continue > to iterate and deploy quickly without affecting the main repo, so no need > to be 100% perfect as of

Re: [VOTE] Release Apache Iceberg 0.13.0 RC2

2022-01-30 Thread Kyle Bendickson
+1 (non-binding) Verified signature, checksum, rat check, build and ran tests, and tested the relevant JAR on both Spark 3.1 and 3.2. - Kyle On Sun, Jan 30, 2022 at 12:45 AM Szehon Ho wrote: > +1 (non-binding) > > Verified signature > Verified checksum > Rat check > Built and ran test, all suc

Re: [VOTE] Release Apache Iceberg 0.13.0 RC1

2022-01-25 Thread Kyle Bendickson
Thank you, Jack! Quick announcement when testing: *the runtime jars / artifacts for Spark & Flink have changed naming format *to include the corresponding Spark / Flink version. The Spark jars also have the Scala version appended at the end. *Spark:* You can test the 0.13.0-rc1, fetching it from

Re: Iceberg engine version maintenance lifecycle

2022-01-08 Thread Kyle Bendickson
Thank you Jack for your thoughts. I'm very much in agreement with you. I'd like to discuss the beta version further. Ideally, to me, the beta version is the minimum change set to work as-is with that version of the system. We would ideally create a beta that ignores new features, optimizations,

Re: Time-sliced incremental scan

2022-01-08 Thread Kyle Bendickson
Thank you Ryan for summarizing that so well. I'm in agreement that it's too convenient to simply ignore due to those caveats, though they are admittedly potentially large caveats. However, some people don't interact with their table that way and I see discussion around ways to implement increment

Re: REST catalog proposal

2021-12-14 Thread Kyle Bendickson
Hi Ryan, Sorry for the late response. I feel Jack and Ryan have summed up things very well. I will also answer the questions from my perspective, as you did ask and I do a few thoughts outside of what was shared. For starters, this is an additional catalog. The other catalogs, as well as the ab

Re: Single multi-process commit

2021-12-03 Thread Kyle Bendickson
This could also be achieved using the Write-Audit-Publish feature I believe, where you audit a set of writes and then choose to publish them. Though I'm not as familiar with that feature, but you might look into that as well. Thanks, Kyle Bendickson

Re: Iceberg event notification support

2021-11-30 Thread Kyle Bendickson
I think this is a great idea, Jack. Thank you for bringing this up! +1 There have been several people interested in having more observability (for example for table design patterns akin to how folks might monitor Hive) and events would be a big win for that and something users could use with a lot

Re: Proposal: Switch docs site from mkdocs to hugo and relocate to a separate iceberg-docs repo

2021-11-29 Thread Kyle Bendickson
Wow, the prototype looks great, Sam! I'd like to add a little bit about possible avenues for hosting to explore and other corner areas. I only have one thing to add: 1) For the latest docs, can we consider including a warning message on the page that this is for the master version. Apache Flink

Re: Welcome new PMC members!

2021-11-17 Thread Kyle Bendickson
Congratulations to both Jack and Russell! Very we deserved indeed :) On Wed, Nov 17, 2021 at 4:12 PM Ryan Blue wrote: > Hi everyone, I want to welcome Jack Ye and Russell Spitzer to the Iceberg > PMC. They've both been amazing at reviewing and helping people in the > community and the PMC has d

[ANNOUNCE] Apache Iceberg release 0.12.1

2021-11-10 Thread Kyle Bendickson
ution. This release can be downloaded from https://www.apache.org/dyn/closer.cgi/iceberg/apache-iceberg-0.12.1/apache-iceberg-0.12.1.tar.gz Java artifacts are available from Maven Central. Thanks to everyone for contributing! -- Best, Kyle Bendickson Github: @kbendick

[RESULT] [VOTE] Release Apache Iceberg 0.12.1

2021-11-08 Thread Kyle Bendickson
OM `hadoop_prod`.`default`.`flink_table`; >>> ++--+ >>> | id | data | >>> ++--+ >>> | 1 | AAA | >>> ++--+ >>> 1 row in set >>> >>> Thanks all for the work. >>> >>> On Fri, Nov 5, 2021 at 2

Re: [DISCUSS] Iceberg roadmap

2021-11-07 Thread Kyle Bendickson
the community valued, could a connector be built to target the Apache Doris project and then StarRocks could fork that code if they liked? - Kyle Bendickson GitHub @kbendick On Sun, Nov 7, 2021 at 9:24 PM Reo Lei wrote: > +1, I have the same concern for the incompatible license. > >

Re: Standard practices around PRs against multiple Spark versions

2021-11-06 Thread Kyle Bendickson
or >> other projects, like Parquet, have so much content that isn't useful that I >> think they're encouraging people with small commits to abandon the effort. >> >> Ryan >> >> On Fri, Nov 5, 2021 at 1:06 PM Kyle Bendickson wrote: >> >>>

Re: Standard practices around PRs against multiple Spark versions

2021-11-05 Thread Kyle Bendickson
guidelines related to engine versions, as well as >>> Iceberg's preferred code style, naming convention, title convention, etc. >>> to make new contributors a bit easier to submit changes without too much >>> rewrite. Currently I observe that every time there is a

Re: [VOTE] Release Apache Iceberg 0.12.1 RC0

2021-11-04 Thread Kyle Bendickson
defaultCatalog=local >> >> >> On Thu, Nov 4, 2021 at 1:02 PM Jack Ye wrote: >> >>> +1, non-binding >>> >>> ran checksum, build, unit tests, AWS integration tests and verified >>> fixes in EMR 6.4.0. >>> >>> Best, >>

Standard practices around PRs against multiple Spark versions

2021-11-03 Thread Kyle Bendickson
review overhead). Do we have a standard / preference on that? Jack mentioned he wasn't certain, so I thought I'd ask here. Seems like a good practice but hoping to get some clarification :) -- Best, Kyle Bendickson Github: @kbendick

[VOTE] Release Apache Iceberg 0.12.1 RC0

2021-11-02 Thread Kyle Bendickson
Please download, verify, and test. Please vote in the next 72 hours. [ ] +1 Release this as Apache Iceberg [ ] +0 [ ] -1 Do not release this because... -- Best, Kyle Bendickson Github: @kbendick

Re: Iceberg 0.12.1 Patch Release - Call for Bug Fixes and Patches

2021-10-27 Thread Kyle Bendickson
d. And if we don't think that a patch > release for this is needed, then I think that makes it less important to > get it into 0.12.1. > > What does everyone else think? Should we wait for this Hive fix? > > On Wed, Oct 27, 2021 at 3:17 AM OpenInx wrote: > >> I think w

Re: Help improve Iceberg community meeting experience

2021-10-23 Thread Kyle Bendickson
+1 for the suggestion Jack. The time limit has definitely been a point of pain at times. And also, if somebody takes a week or two off of work it can be really easy to miss things. +1 for volunteering to help make this happen Sam! Please let me know if I can help in any way! I wonder if we can al

Re: Iceberg 0.12.1 Patch Release - Call for Bug Fixes and Patches

2021-10-21 Thread Kyle Bendickson
that will be mire up to the community). If any committers or contributors have free cycles and are willing to review some of these PRs, that would be greatly appreciated! - Kyle Bendickson [@kbendick] On Thu, Oct 21, 2021 at 11:19 AM Peter Vary wrote: > Just to make this clean https://github.

Iceberg 0.12.1 Patch Release - Call for Bug Fixes and Patches

2021-10-20 Thread Kyle Bendickson
current list of patches to be included can be found in the milestone on Github: https://github.com/apache/iceberg/milestone/15?closed=1 As new items are added, they will be included in the milestone. Best, Kyle Bendickson [ Github: @kbendick ]

Re: Snapshot tagging, branching and retention

2021-10-18 Thread Kyle Bendickson
Thanks for collecting these notes as well as for the proposal, Jack. Have been traveling today so I couldn't attend. Will be looking out for the new PR. Best, Kyle Bendickson (@kbendick) On Mon, Oct 18, 2021 at 9:58 AM Jack Ye wrote: > Thanks to everyone who came to the meeting

Re: [DISCUSS] Iceberg roadmap

2021-09-17 Thread Kyle Bendickson
each other and can both progress independently. But I would defer to Steven or anybody else who works with Flink much more often than I do currently. - Kyle Bendickson > On Sep 15, 2021, at 4:06 PM, Ryan Blue wrote: > > That sounds great, thanks for taking that on Jack! > > On We

Re: [VOTE] Release Apache Iceberg 0.12.0 RC2

2021-08-06 Thread Kyle Bendickson
+1 (binding) I verified: - KEYS signature & checksum - ./gradlew clean build (tests, etc) - Ran Spark jobs on Kubernetes after building from the tarball at https://dist.apache.org/repos/dist/dev/iceberg/apache-iceberg-0.12.0-rc2/ - Spark 3.1.1 batch jobs against both Hadoop and Hive tables

Re: [VOTE] Adopt the v2 spec changes

2021-07-28 Thread Kyle Bendickson
+1 (non-binding)  Kyle Bendickson Software Engineer Apple ACS Data One Apple Park Way, Cupertino, CA 95014, USA kbendick...@apple.com This email and any attachments may be privileged and may contain confidential information intended only for the recipient(s) named above. Any other

Re: Iceberg 0.12 release ETA

2021-06-16 Thread Kyle Bendickson
Actually, according to the link I just sent the ETA is currently early July for 0.12. - Apologies, Kyle On Wed, Jun 16, 2021 at 2:24 AM Kyle Bendickson wrote: > Additionally, I don’t believe a firm date for the 0.12 release has been > brought up as there is on going work on the v2

Re: iceberg code style

2021-06-16 Thread Kyle Bendickson
This would be good content for the wiki. I believe I have some notes / screenshots for setting this up in IntelliJ that I would be happy to share. I’ll reach out to Carl to see how that might be possible. I will also look for these notes on my laptop tomorrow and send them in the thread if need

Re: Iceberg 0.12 release ETA

2021-06-16 Thread Kyle Bendickson
Additionally, I don’t believe a firm date for the 0.12 release has been brought up as there is on going work on the v2 format. Somebody please correct me if I'm wrong! - Kyle On Wed, Jun 16, 2021 at 2:23 AM Kyle Bendickson wrote: > Hi Justin, > > Unfortunately, the sync has no

Re: Iceberg 0.12 release ETA

2021-06-16 Thread Kyle Bendickson
s are around that but happy to help where I can to ensure the project is meeting its goals and requirements while remaining compliant however required. All the best, Kyle Bendickson, @kbendick on GitHub OSS Developer at Apple kjbendick...@gmail.com / kbendickson[at] apple[dot]com On Tue, Jun 15, 2021

Re: Next community sync

2021-05-25 Thread Kyle Bendickson
Hi Ryan, Can you please add my new work email to the community sync? kbendickson [at] apple [dot ]com Thanks, Kyle!  Kyle Bendickson Software Engineer Apple ACS Data One Apple Park Way, Cupertino, CA 95014, USA kbendick...@apple.com <mailto:kbendick...@apple.com> This email a

Re: Several flink pull requests need to get merged before the next release 0.10.0

2020-10-29 Thread Kyle Bendickson
Oops spoke too soon. Looks like they've all been merged.and I approved the last one /shrug. Sorry for the late night email response everyone. - Kyle On Thu, Oct 29, 2020 at 12:22 AM Kyle Bendickson wrote: > I will go through and re-review all of these PRs over the next two days > Zh

PR to switch to Github Actions and fix the PR Labeler

2020-10-29 Thread Kyle Bendickson
g/jira/browse/SPARK-33282 Thanks Kyle Bendickson Github: @kbendick

Re: Several flink pull requests need to get merged before the next release 0.10.0

2020-10-29 Thread Kyle Bendickson
I will go through and re-review all of these PRs over the next two days Zheng to help get these merged asap. - Kyle @kbendick On Tue, Oct 27, 2020 at 1:30 AM OpenInx wrote: > Hi Ryan > > Is it the correct time once we get the PR 1477 merged ? Do we have any > other blockers for the coming rele