Hi everyone, Here's my current draft board report for June. If you have anything to add or update, please reply and I'll amend the report.
Thanks, Ryan ## Description: Apache Iceberg is a table format for huge analytic datasets that is designed for high performance and ease of use. ## Project Status: Current project status: Ongoing Issues for the board: None ## Membership Data: Apache Iceberg was founded 2020-05-19 (4 years ago) There are currently 27 committers and 16 PMC members in this project. The Committer-to-PMC ratio is roughly 7:4. Community changes, past quarter: - No new PMC members. Last addition was Szehon Ho on 2023-04-20. - No new committers. Last addition was Renjie Liu on 2024-03-06. ## Project Activity: Releases: - 1.5.1 was released on 2024-04-25 - 1.5.2 was released on 2024-05-09 - PyIceberg 0.6.1 was released on 2024-04-30 PyIceberg: - Contributors are working to release more often - Improved retries for Hive catalog locking - Added register table support for Glue catalogs - Adding metadata table support (snapshots, manifests, etc.) - Working toward 0.7.0 release with partitioned writes and staged table creation Rust: - Implemented projection to support partition-based file pruning - Implemented the inclusive metrics evaluator and predicate pushdown to Parquet - Added Hive catalog support - Improved REST catalog with OAuth2 and custom headers - Added integration with DataFusion Go: - Working toward full expression support; added literals Iceberg Java: - The next Java release, 1.6.0, is targeted for release in June Specs: - Discussions about standardizing metadata for materialized views have made good progress. The community decided to use existing objects rather than creating a new combined table/view object and is working on metadata details. - An extension to the REST protocol for privilege GRANT and REVOKE operations was proposed. - Many discussions for extending the REST protocol are ongoing, including adding routes to plan scans, adding auth decisions, and appending data files - There are also discussions for v3 features, like additional types (variant, timestampns, and others) ## Community Health: The Iceberg community continues to be healthy, with a large number of commits and individual contributors over the past quarter. Although overall commits decreased, the change corresponds with the number of opened PRs so the change is not a concern for health; PRs are getting reviewed. The community is formalizing design discussions and has added github labels and documented a process for making changes to community specs. The community also held the first Iceberg Summit this quarter, with 32 sessions that are now available on the YouTube (https://tinyurl.com/iceberg-summit). Community members also spoke at CoC EU. A company that employs 3 PMC members and 2 committers was acquired. The PMC members (2 of whom are ASF members) have been reminded to act as individuals, not as representatives of their employer, when interacting in the community. Concentrations of PMC members is a risk that the community is aware of and will note in future board reports. Other projects and announcements: - Trino added support for Iceberg views - Beam has added an Iceberg sink - Confluent, Terradata, and Oracle announced Iceberg support - Snowflake announced a new open source REST catalog project - Databricks released its catalog that implements the REST spec -- Ryan Blue Tabular