I am sorry -- I seem to have edited the wrong document I updated the google doc https://docs.google.com/document/d/10MrrMHyfALB9Vpjtl-6abGWdNz6VLiWR9yI1a6Yl1aw/edit with the intended content and copy/pasted it below as well
## Description: The mission of Apache DataFusion is the creation and maintenance of software related to an extensible query engine ## Project Status: Current project status: New + Ongoing (high activity) Issues for the board: None ## Membership Data: Apache DataFusion was founded 2024-04-16 (4 months ago) There are currently 34 committers and 14 PMC members in this project. The Committer-to-PMC ratio is roughly 9:4. Community changes, past quarter: - Jay Zhan was added to the PMC on 2024-08-11 - Mehmet Ozan Kabak was added to the PMC on 2024-06-12 - Ruihang Xia was added to the PMC on 2024-06-12 - Huaxin Gao was added as committer on 2024-05-23 - Kazuyuki Tanimura was added as committer on 2024-05-26 - Eduard Karacharov was added as committer on 2024-08-14 - Lewis Zhang was added as committer on 2024-06-14 ## Project Activity: The project continues to be active with many PRs and issues opened and closed per day. We wrote two public blogs about our work: [1], [2] and DataFusion and systems built on it are being featured in high profile (for the Database world) venues such as the [CMU Database Systems Seminar] [1]: https://datafusion.apache.org/blog/2024/07/24/datafusion-40.0.0/ [2]: https://datafusion.apache.org/blog/2024/07/20/datafusion-comet-0.1.0/ [CMU Database Systems Seminar]: https://db.cs.cmu.edu/seminar2024/ We are working to [adopt the sqlparser crate] into the project as well [adopt the sqlparser crate]: https://github.com/sqlparser-rs/sqlparser-rs/issues/1294 ### DataFusion core https://github.com/apache/datafusion We continue the monthly release cadence versions 40.0.0 and 41.0.0 and are on track for version 42.0.0. The [41.0.0 release] had almost 70 unique contributors. We are currently focused on performance including for high cardinality aggregates and adding support for StringViewArrays. We completed a long running project to ensure all aggregate functions use the same API and are beginning the same project for window functions. There is work to add LogicalTypes, as well as to create a more differentiated CLI experience. See the [roadmap ticket] for more details. [41.0.0 release]: https://github.com/apache/datafusion/blob/main/dev/changelog/41.0.0.md [roadmap ticket]: https://github.com/apache/datafusion/issues/11442 ### Sub project: DataFusion Python https://github.com/apache/datafusion-python The DataFusion Python project has received significant contributions recently to make the project more “Pythonic” and now has regular activity from maintainers. ### Sub project: DataFusion Comet https://github.com/apache/datafusion-comet The Comet project is very active and recently released its initial 0.1.0 source release. Blog post: https://datafusion.apache.org/blog/2024/07/20/datafusion-comet-0.1.0/ ### Sub project: DataFusion Ballista https://github.com/apache/datafusion-ballista https://github.com/apache/datafusion-ballista-python The Ballista subproject is not very actively maintained, but there have been some contributions recently to upgrade to more recent versions of the core DataFusion project. ### Recent Releases * PYTHON-39.0.0 was released on 2024-07-02. * 39.0.0 was released on 2024-06-10. * PYTHON-38.0.1 was released on 2024-05-30. * PYTHON-37.1.0 was released on 2024-05-13. * 38.0.0 was released on 2024-05-10. ## Community Health: It is still hard to keep track of everything going on these days, which is a good thing. While it is always a struggle to get enough code review, the committers keep things going and the community helps each other out with reviews. We continue to actively grow our committer and PMC ranks. There are currently three meetups planned: New York City, San Francisco (for the second time!) and Belgrade. On Tue, Aug 20, 2024 at 2:34 PM Andrew Lamb <al...@influxdata.com> wrote: > It is time to provide another board report. This is the start of our > quarterly reports so after this we'll have only reports each quarter > > I also think the primary reason for these reports is for the board to make > sure the project is ok. The actual project details are more likely helpful > for other members of the project than the board. > > With that being said, please feel free to provide your comments on the > ticket[1], google doc[2] or reply to this email and I will incorporate them > > I plan to submit this to the board on September 11, 2024 > > Thanks, > Andrew > > [1]: https://github.com/apache/datafusion/issues/10156 > [2]: > https://docs.google.com/document/d/10MrrMHyfALB9Vpjtl-6abGWdNz6VLiWR9yI1a6Yl1aw/edit > > ----- > > > ## Description: > > The mission of Apache DataFusion is the creation and maintenance of > software > > related to an extensible query engine > > ## Project Status: > > Current project status: New + Ongoing (high activity) > > Issues for the board: None > > > ## Membership Data: > > Apache DataFusion was founded 2024-04-16 (3 months ago) > > There are currently 33 committers and 13 PMC members in this project. > > The Committer-to-PMC ratio is roughly 9:4. > > Community changes, past month: > > - Mehmet Ozan Kabak was added to the PMC on 2024-06-12 > > - Ruihang Xia was added to the PMC on 2024-06-12 > > - Lewis Zhang was added as committer on 2024-06-14 > > > ## Project Activity: > > The project continues to be quite active with many PRs and issues opened > and > > closed per day. > > We started working on a project blog [1] (previously we used the arrow > blog) > > and hope to have our first blog post as an independent project later this > > month. > > There was a well attended face to face meetup in San Francisco, CA USA in > June > > [2]. We have one planned for Hangzhou, China in July[3]. There appears > > significant interest in these events and there are at least 2 more planned > > for September in New York, NY USA and in Belgrade, Serbia > > The community around DataFusion is growing too. For example, Spice AI has > made > > an initial contribution of TableProviders to datafusion-contrib [4] for > > PostgreSQL, MySQL, DuckDB, and SQLite, enabling these data sources to be > > easily queried through DataFusion. > > [1]: https://datafusion.apache.org/blog/ > > [2]: https://github.com/apache/datafusion/discussions/10800 > > [3]: https://github.com/apache/datafusion/discussions/10341 > > #discussioncomment-9738748 > > [4]: https://github.com/datafusion-contrib/datafusion-table-providers > > ### DataFusion core > > https://github.com/apache/datafusion > > We released version 39.0.0, continuing our schedule of monthly releases and > > are on track to release version 40.0.0 in the next day or two. > > Some projects we have been working on recently involve adding support for > more > > flexible use of Parquet files including indexing and extracting > statistics. We > > are also working with the community to make extending SQL planning[2] > easier > > and extending file format support[3], as well as fixing bugs found with a > SQL > > fuzzer[4], and improving performance with StringView[5]. > > It has been nice to see several good examples of cross contributor/company > > collaboration such as [6] and [7]. > > We have also been making external presentations[1] > > [1]: https://github.com/apache/datafusion/issues/10969 > > [2]: https://github.com/apache/datafusion/issues/10534 > > [3]: https://github.com/apache/datafusion/pull/11060 > > [4]: https://github.com/apache/datafusion/issues/11030 > > [5]: https://github.com/apache/datafusion/issues/10918 > > [6]: https://github.com/apache/datafusion/pull/11203 > > [7]: https://github.com/apache/datafusion/issues/10534 > > ### Sub project: DataFusion Python > > https://github.com/apache/datafusion-python > > The DataFusion Python project continues to receive updates as new versions > of > > the core DataFusion crate are released. There have also been some minor > > improvements to improve user experience. > > > ### Sub project: DataFusion Comet > > https://github.com/apache/datafusion-comet > > The Comet project is very active and is working towards an initial 0.1.0 > > source release. Initial benchmark results were published to > > https://datafusion.apache.org/comet/contributor-guide/benchmarking.html. > > > ### Sub project: DataFusion Ballista > > https://github.com/apache/datafusion-ballista > > https://github.com/apache/datafusion-ballista-python > > The Ballista subproject is not very actively maintained, but there have > been > > some contributions recently to upgrade to more recent versions of the core > > DataFusion project. > > ### Recent Releases > > * PYTHON-39.0.0 was released on 2024-07-02. > > * 39.0.0 was released on 2024-06-10. > > * PYTHON-38.0.1 was released on 2024-05-30. > > * PYTHON-37.1.0 was released on 2024-05-13. > > * 38.0.0 was released on 2024-05-10. > > > ## Community Health: > > Community health is good -- we recently hit the 600 total contributors > mark according to Github. This number is partially inflated from initially > being part of the Arrow mono repo but the trend is healthy non the less. > > It is hard to keep track of everything going on these days, which is a > good thing. While it is always a struggle to get enough code review, the > committers keep things going and the community helps each other out with > reviews. > > > > > >