Hi Ryan,

Thanks for putting together the report!

I have a couple of items that might be helpful to include.

PyIceberg
* Updated the release process and documentation
* Updated integration tests to use the TCK REST catalog docker image
(`apache/iceberg-rest-fixture`) built using the apache/iceberg repo
* Added manifest files caching
* Added support for all metadata tables via the Table Inspect API
* Added support for High Availability mode for Hive Metastore
* Removed `numpy` as a hard dependency
* Pyiceberg crossed 100k daily downloads on PyPi (
https://pypistats.org/packages/pyiceberg)

Community
* Several reoccurring community meetups have started in
Seattle/SF/Singapore and more are planned
* Meetup presentations are recorded and available on the Apache Iceberg
Meetup YouTube channel (https://www.youtube.com/@IcebergMeetup)

Best,
Kevin Liu

On Wed, Dec 11, 2024 at 10:30 PM Péter Váry <peter.vary.apa...@gmail.com>
wrote:

> Hi Ryan,
> Thanks for putting this together!
> For Java/Flink we could mention that ExpireSnapshots TableMaintenance is
> available now.
>
> On Thu, Dec 12, 2024, 04:47 Ajantha Bhat <ajanthab...@gmail.com> wrote:
>
>> At Java side, I would add
>>
>> - Core util to compute partition stats has been merged.
>> https://github.com/apache/iceberg/pull/11146
>>
>> - REST catalog TCK has been merged and docker image is published under `
>>
>> *apache/iceberg-rest-fixture`*Also,
>>
>>>  Spark: Removed Spark 3.3 support
>>
>> We just deprecated it after 1.7.0, so there will be one last release in
>> 1.8.0.
>> Maybe we can reword as Deprecated instead of removed as the code is still
>> there.
>>
>> - Ajantha
>>
>> On Thu, Dec 12, 2024 at 7:25 AM Renjie Liu <liurenjie2...@gmail.com>
>> wrote:
>>
>>> For rust, we have added support parquet data file writer, and support
>>> for other writers are undergoing.
>>>
>>>
>>> On Thu, Dec 12, 2024 at 9:26 AM Gang Wu <ust...@gmail.com> wrote:
>>>
>>>> For C++, I think it is aimed for a full featured C++ library (not for
>>>> puffin implementation only).
>>>>
>>>> On Thu, Dec 12, 2024 at 6:14 AM rdb...@gmail.com <rdb...@gmail.com>
>>>> wrote:
>>>>
>>>>> I'll update it. Thanks!
>>>>>
>>>>> (By the way, the Avro default value support was in the Java section)
>>>>>
>>>>> On Wed, Dec 11, 2024 at 2:00 PM Matt Topol <zotthewiz...@gmail.com>
>>>>> wrote:
>>>>>
>>>>>> For the Go release, can we please point out that it supports reading
>>>>>> the data too, not just metadata?
>>>>>>
>>>>>> It produces a stream of Arrow record batches.
>>>>>>
>>>>>> On Wed, Dec 11, 2024, 4:22 PM Walaa Eldin Moustafa <
>>>>>> wa.moust...@gmail.com> wrote:
>>>>>>
>>>>>>> Hi Ryan,
>>>>>>>
>>>>>>> For Table Format V3, we could point out that the default value
>>>>>>> support for Avro has been merged and support for other formats is 
>>>>>>> ongoing.
>>>>>>>
>>>>>>> Thanks,
>>>>>>> Walaa.
>>>>>>>
>>>>>>>
>>>>>>> On Wed, Dec 11, 2024 at 12:51 PM rdb...@gmail.com <rdb...@gmail.com>
>>>>>>> wrote:
>>>>>>>
>>>>>>>> Hi everyone,
>>>>>>>>
>>>>>>>> It’s time to report to the board again. Great to see all the
>>>>>>>> progress here, and awesome to have our first go release this quarter!
>>>>>>>>
>>>>>>>> My draft is below. Please reply if there’s anything you’d like to
>>>>>>>> add or change. Thanks!
>>>>>>>>
>>>>>>>> Ryan
>>>>>>>> Description:
>>>>>>>>
>>>>>>>> Apache Iceberg is a table format for huge analytic datasets that is
>>>>>>>> designed
>>>>>>>> for high performance and ease of use.
>>>>>>>> Project Status:
>>>>>>>>
>>>>>>>> Current project status: Ongoing
>>>>>>>> Issues for the board: None
>>>>>>>> Membership Data:
>>>>>>>>
>>>>>>>> Apache Iceberg was founded 2020-05-19 (5 years ago)
>>>>>>>> There are currently 32 committers and 21 PMC members in this
>>>>>>>> project.
>>>>>>>> The Committer-to-PMC ratio is roughly 4:3.
>>>>>>>>
>>>>>>>> Community changes, past quarter:
>>>>>>>>
>>>>>>>>    - No new PMC members. Last addition was Amogh Jahagirdar on
>>>>>>>>    2024-08-12.
>>>>>>>>    - Matthew Topol was added as committer on 2024-12-09
>>>>>>>>    - Scott Donnelly was added as committer on 2024-12-10
>>>>>>>>
>>>>>>>> Project Activity:
>>>>>>>>
>>>>>>>> Releases
>>>>>>>>
>>>>>>>>    - 1.7.1 was released on 2024-12-06.
>>>>>>>>    - 1.7.0 was released on 2024-11-08.
>>>>>>>>    - PyIceberg 0.8.1 was released on 2024-12-06.
>>>>>>>>    - PyIceberg 0.8.0 was released on 2024-11-18.
>>>>>>>>    - Go 0.1.0 was released on 2024-11-18.
>>>>>>>>
>>>>>>>> Table format (v3)
>>>>>>>>
>>>>>>>>    - Added deletion vectors and synchronous maintenance to improve
>>>>>>>>    row-level ops
>>>>>>>>    - Added row lineage fields and requirements for fine-grained
>>>>>>>>    row tracking
>>>>>>>>    - Proposal for geography and geometry types is close to
>>>>>>>>    consensus
>>>>>>>>    - Update to add Parquet’s variant type is approved, waiting on
>>>>>>>>    Parquet upstream
>>>>>>>>    - Finalized new type promotion rules
>>>>>>>>
>>>>>>>> Puffin format
>>>>>>>>
>>>>>>>>    - Added deletion vector blob type to support DVs in tables
>>>>>>>>
>>>>>>>> REST catalog spec
>>>>>>>>
>>>>>>>>    - Added storage credentials passing
>>>>>>>>    - Added credential refresh
>>>>>>>>    - Created a docker image for catalog testing
>>>>>>>>    - Discussing proposal for partial metadata commits
>>>>>>>>    - Discussed partial metadata loading
>>>>>>>>
>>>>>>>> Views
>>>>>>>>
>>>>>>>>    - Discussions about materialized view metadata are ongoing
>>>>>>>>
>>>>>>>> Java
>>>>>>>>
>>>>>>>>    - Released new Kafka Connect sink
>>>>>>>>    - Added default values implementation for Avro
>>>>>>>>    - Added nanosecond timestamps
>>>>>>>>    - Added v3 DV support in core, ongoing work in Spark
>>>>>>>>    - Flink: Made FLIP-27 source the default
>>>>>>>>    - Spark: Removed Spark 3.3 support
>>>>>>>>    - Hive: Removing Hive 2.x and 3.x (Iceberg support is in Hive
>>>>>>>>    for 4.x and on)
>>>>>>>>    - Pig: Removed the iceberg-pig module that is no longer used
>>>>>>>>
>>>>>>>> PyIceberg
>>>>>>>>
>>>>>>>>    - Support: Added Python 3.12, dropped Python 3.8
>>>>>>>>
>>>>>>>> Rust
>>>>>>>>
>>>>>>>>    - Support for default values and type promotion in reads
>>>>>>>>    - Added TableMetadataBuilder
>>>>>>>>    - Implemented table requirements
>>>>>>>>
>>>>>>>> Go
>>>>>>>>
>>>>>>>>    - Produced the first go release!
>>>>>>>>    - Supports scan planning and reading metadata
>>>>>>>>    - Supports loading and listing tables with the Glue catalog
>>>>>>>>    - Supports local and S3 storage
>>>>>>>>
>>>>>>>> C++
>>>>>>>>
>>>>>>>>    - Added a C++ repository for a Puffin implementation
>>>>>>>>
>>>>>>>> Community Health:
>>>>>>>>
>>>>>>>> The PMC has published guidelines for contributors that want to know
>>>>>>>> more about
>>>>>>>> how they can become committers on the Iceberg site. This guide
>>>>>>>> should help
>>>>>>>> contributors understand how Iceberg and other ASF communities
>>>>>>>> decide and add
>>>>>>>> committers, and should set expectations clearly. This was the most
>>>>>>>> important
>>>>>>>> follow up from discussions on the dev list earlier this year, where
>>>>>>>> it became
>>>>>>>> clear that contributors did not understand the requirements or
>>>>>>>> process.
>>>>>>>>
>>>>>>>> The community has started planning a second Iceberg Summit,
>>>>>>>> intended to be held
>>>>>>>> in Spring of 2025. The proposal details are being finalized (such
>>>>>>>> as the members
>>>>>>>> of the selection committee) and will be submitted for approval in
>>>>>>>> the next few
>>>>>>>> weeks.
>>>>>>>>
>>>>>>>> The community added two new committers this quarter and had a
>>>>>>>> slight increase in
>>>>>>>> the number of contributors.
>>>>>>>>
>>>>>>>> There were also a number of commercial announcements from companies
>>>>>>>> adding or
>>>>>>>> expanding support for Iceberg.
>>>>>>>>
>>>>>>>

Reply via email to