[jira] [Updated] (CASSSIDECAR-341) Implement Durable State for Long-Running Sidecar Jobs

Jira Wed, 19 Nov 2025 14:24:07 -0800


     [ 
https://issues.apache.org/jira/browse/CASSSIDECAR-341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Andrés Beck-Ruiz updated CASSSIDECAR-341:
-----------------------------------------
      Epic Link: CASSSIDECAR-370
    Description: 
The Sidecar's job management framework 
(https://issues.apache.org/jira/browse/CASSSIDECAR-150) currently [stores job 
state 
in-memory|https://github.com/apache/cassandra-sidecar/blob/0ad3e6fe545abc9254a88bd8de15f24af27d68c7/server/src/main/java/org/apache/cassandra/sidecar/job/OperationalJobTracker.java#L49].
 This approach makes some assumptions around the duration of the job and is not 
resilient, as all job information is lost if the Sidecar restarts.

To support long-running operations like `repair` 
(https://issues.apache.org/jira/browse/CASSSIDECAR-268) that may take days to 
complete, or cluster-wide operations such as a rolling restart, we must make 
the job state durable. 

This ticket will track all the subtasks needed to enable the current 
operational job framework to store operational job state in persistent storage. 

  was:
The Sidecar's job management framework 
(https://issues.apache.org/jira/browse/CASSSIDECAR-150) currently stores job 
state in-memory. This approach makes some assumptions around the duration of 
the job and is not resilient, as all job information is lost if the Sidecar 
restarts.

To support long-running operations like `repair` 
(https://issues.apache.org/jira/browse/CASSSIDECAR-268) that may take days to 
complete, we must make the job state durable. This change will introduce a 
persistent storage layer, ensuring that the status and progress of asynchronous 
jobs survive restarts.


> Implement Durable State for Long-Running Sidecar Jobs
> -----------------------------------------------------
>
>                 Key: CASSSIDECAR-341
>                 URL: https://issues.apache.org/jira/browse/CASSSIDECAR-341
>             Project: Sidecar for Apache Cassandra
>          Issue Type: Improvement
>            Reporter: Arjun Ashok
>            Priority: Major
>              Labels: RollingRestart
>
> The Sidecar's job management framework 
> (https://issues.apache.org/jira/browse/CASSSIDECAR-150) currently [stores job 
> state 
> in-memory|https://github.com/apache/cassandra-sidecar/blob/0ad3e6fe545abc9254a88bd8de15f24af27d68c7/server/src/main/java/org/apache/cassandra/sidecar/job/OperationalJobTracker.java#L49].
>  This approach makes some assumptions around the duration of the job and is 
> not resilient, as all job information is lost if the Sidecar restarts.
> To support long-running operations like `repair` 
> (https://issues.apache.org/jira/browse/CASSSIDECAR-268) that may take days to 
> complete, or cluster-wide operations such as a rolling restart, we must make 
> the job state durable. 
> This ticket will track all the subtasks needed to enable the current 
> operational job framework to store operational job state in persistent 
> storage. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Updated] (CASSSIDECAR-341) Implement Durable State for Long-Running Sidecar Jobs

Reply via email to