[
https://issues.apache.org/jira/browse/CASSANALYTICS-50?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17950375#comment-17950375
]
Andrew Johnson commented on CASSANALYTICS-50:
---------------------------------------------
CI run
[https://app.circleci.com/pipelines/github/anderoo/cassandra-analytics/5/workflows/a6be5c5a-4002-4bf1-a44f-7ddb2984d5aa]
!Screenshot 2025-05-08 at 23.04.00.png!
> [Analytics] Add support for vnodes
> ----------------------------------
>
> Key: CASSANALYTICS-50
> URL: https://issues.apache.org/jira/browse/CASSANALYTICS-50
> Project: Apache Cassandra Analytics
> Issue Type: Improvement
> Components: Reader, Writer
> Reporter: James Berragan
> Assignee: Andrew Johnson
> Priority: Normal
> Attachments: Screenshot 2025-05-08 at 23.03.49.png, Screenshot
> 2025-05-08 at 23.04.00.png
>
> Time Spent: 5h 20m
> Remaining Estimate: 0h
>
> Analytics currently assumes 1 token per node, but most people in the
> community use vnodes by default. We should add support for vnodes, at the
> most basic level we can improve to issue a Spark task per vnode range. After
> that optimizations can be added to more intelligently merge contiguous
> ranges, or sort tasks to avoid the Spark JVM connecting to the entire cluster.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]