Hi everyone, It’s time for another board report from Iceberg. I’ve gone ahead and prepared a draft. If you have anything to add, please reply with updates.
Ryan Description: Apache Iceberg is a table format for huge analytic datasets that is designed for high performance and ease of use. Issues: There are no issues requiring board attention. Membership Data: Apache Iceberg was founded 2020-05-19 (2 years ago) There are currently 18 committers and 12 PMC members in this project. The Committer-to-PMC ratio is 3:2. Community changes, past quarter: - Jack Ye was added to the PMC on 2021-11-14 - Russell Spitzer was added to the PMC on 2021-11-13 - No new committers. Last addition was Jack Ye on 2021-07-02. Project Activity: 0.12.1 was released on 2021-11-08. The community is also working on the next release, 0.13.0. - A spec for table branching and tagging was written and is nearing completion - Iceberg’s documentation is being updated so that multiple versions can be easily maintained and updated. - Delete file compaction was added to the rewrite files action and stored procedure. Additional compaction options are planned. - Sort based compaction was added - Flink and Spark plugins have been refactored so that each version is independent and is compiled against the correct engine version. While this duplicates some code, it makes integrating new features easier and reduces the risk of runtime incompatibilities. - Added support for Flink 1.14.x and Spark 3.2.x - A REST catalog API spec is taking shape. This should standardize an interface for providing a table catalog, similar to the thrift metastore interface used by Hive. - Aliyun OSS support was added as an IO module - The community decided on goals for a 1.0 release, targeted for early next year - Python implementation is making progress Community Health: Community metrics show healthy growth. Notably, there were 66 unique contributors this quarter, up from 50 last quarter. Total PRs submitted was more than 750, about 50% more than the 500 last quarter. Similarly, PRs closed also increased to 682 from about 400 last quarter, a 64% increase. The most significant stat is the increase in unique contributors, which signals that more people are interested in the project. This quarter, there were talks featuring Iceberg at AWS re:Invent (where Athena announced support), Trino summit, and community events for PrestoDB, lakeFS, and SF Big Analytics. -- Ryan Blue