Is the current roadmap published somewhere? I went to Confluence and
couldn't find anything.

Patrick

On Tue, Apr 22, 2025 at 10:53 AM Doug Rohrer <droh...@apple.com> wrote:

> Hello folks,
>
> As many of you on the ASF Slack may have noticed, I’ve been creating a
> bunch of new tickets for the Cassandra Analytics project related to a 1.0
> release. Since it was initially contributed, there have been many
> enhancements and fixes to the library, but there are still some gaps that
> need to be addressed. We’re putting together a plan to close those gaps,
> and would love to enlist more folks from the community in making the
> analytics library more useful. The gaps we see today include:
>
>    - vnode support (and optimizations to the exiting code if necessary to
>    make it work more efficiently with clusters using vnodes) (
>    CASSANALYTICS-10
>    <https://issues.apache.org/jira/browse/CASSANALYTICS-10>)
>    - Cassandra 5.0 support (this is an epic with lots of subtasks, some
>    of which are already being worked on by a variety of folks) (
>    CASSANALYTICS-23
>    <https://issues.apache.org/jira/browse/CASSANALYTICS-23>)
>    - Documentation, including both docs on cassandra.apache.org and
>    updated/improved developer docs in the repository itself (
>    CASSANALYTICS-6 <https://issues.apache.org/jira/browse/CASSANALYTICS-6>
>    )
>    - Build scripts for release (CASSANALYTICS-22
>    <https://issues.apache.org/jira/browse/CASSANALYTICS-22>)
>    - Miscellaneous bug fixes of known issues/improvements
>       - Analytics writer should support all valid partition/clustering
>       key types (CASSANALYTICS-35
>       <https://issues.apache.org/jira/browse/CASSANALYTICS-35>)
>       - CassandraDataLayer uses configuration list of IPs instead of the
>       full ring/datacenter (CASSANALYTICS-20
>       <https://issues.apache.org/jira/browse/CASSANALYTICS-20>)
>       - Bulk Reader should dynamically calculate number of cores to use
>       to better utilize resources for smaller tables (CASSANALYTICS-36
>       <https://issues.apache.org/jira/browse/CASSANALYTICS-36>)
>
>
> Beyond 1.0, there’s a lot of improvements and enhancements on the roadmap
> to date:
>
>    - Cassandra 6.0 Support (CASSANALYTICS-37
>    <https://issues.apache.org/jira/browse/CASSANALYTICS-37>)
>    - Spark 4.0 support (CASSANALYTICS-34
>    <https://issues.apache.org/jira/browse/CASSANALYTICS-34>)
>    - JDK Support Matrix (CASSANALYTICS-38
>    <https://issues.apache.org/jira/browse/CASSANALYTICS-38>)
>    - Improved Compaction/Repair load for bulk writes (CASSANALYTICS-39
>    <https://issues.apache.org/jira/browse/CASSANALYTICS-39>)
>    - Bandwidth reduction (especially cross-dc writes) (CASSANALYTICS-40
>    <https://issues.apache.org/jira/browse/CASSANALYTICS-40>)
>    - Consolidation of SBW-on-S3 and DIRECT mode code (CASSANALYTICS-41
>    <https://issues.apache.org/jira/browse/CASSANALYTICS-41>)
>    - Bulk reads via S3 (CASSANALYTICS-42
>    <https://issues.apache.org/jira/browse/CASSANALYTICS-42>)
>
>
> We’re also looking for input on what others think should be in the 1.0
> release, or the long-term roadmap. If you’ve got ideas, don’t hesitate to
> respond to this thread. I’ll also be checking the existing JIRAs and making
> sure they are incorporated into the plan, which I believe most are already.
>
> I want to thank the folks who have, so far, contributed most of the code
> for the Analytics library, and those in the community who have already
> started to use and improve it. We’re looking forward to getting more
> community members involved. If any of these items sounds interesting,
> please feel free to reach out to folks on Slack or reply on the dev list.
>
> Thanks,
>
> Doug Rohrer
>

Reply via email to