Hi Karthick, To share a few thoughts on this - I guess a lot depends on your actual setup / requirements, such as:
- The sort of potential outages you really need to tackle with this - The volume of data you have to continuously process - Constraints you may have in terms of messages that could be(come) duplicated or out-of-order in cases of outages. At least that will likely happen at the boundaries where you switch to / back from a secondary backup storage, and in the process of replaying data from that backup storage back to Kafka. For certain kinds of network outages for example, your backup solution could fail the same way as the access to Kafka - so you might not be able to connect to Cassandra either. At a previous job, we used a local on-disk storage buffer for specific (critical) producers that couldn't afford even short outages in the connection to Kafka. Producers would then replay buffered data from disk back to Kafka once it came back up. That is of course just 'local' handling (and limited by available disk-space), but it doesn't introduce a dependency on another external system as backup. If you are however certain to need another (external) system to serve as backup, why not just use another (fully separate) Kafka cluster as your backup, too? Using Kafka for this again seems only natural to me. Introducing a fully different technology seems like overkill when it's really just another message buffer you need - plus a different tech-stack can have its own requirements and challenges to maintain. And as added benefit, any tooling you may have to observe your actual Kafka cluster could be used in similar ways also for data in the backup cluster (even if that's ideally just empty). Best, Ralph On Fri, Mar 8, 2024 at 12:54 PM Karthick <ibmkarthickma...@gmail.com> wrote: > Hi guys, I'm still seeking for a solution, I hope many face this problem in > production. Any working solutions will help us. > > On Sat, Feb 17, 2024 at 2:16 PM Karthick <ibmkarthickma...@gmail.com> > wrote: > > > Dear Kafka Community, > > > > I am reaching out to seek your valuable feedback and insights on a > > proposed solution we are considering for managing Kafka outages using > > Cassandra. > > > > At our organization, we heavily rely on Kafka for real-time data > > processing and messaging. However, like any technology, Kafka is > > susceptible to occasional outages which can disrupt our operations and > > impact our services. To mitigate the impact of such outages and ensure > > continuity, we are exploring the possibility of leveraging Cassandra as a > > backup solution. > > > > Our proposed approach involves storing messages in Cassandra during Kafka > > outages. Subsequently, we plan to implement a scheduler that will read > from > > Cassandra and attempt to write these messages back into Kafka once it is > > operational again. > > > > We believe that by adopting this strategy, we can achieve the following > > benefits: > > > > 1. > > > > Improved Fault Tolerance: By having a backup mechanism in place, we > > can reduce the risk of data loss and ensure continuity of operations > during > > Kafka outages. > > 2. > > > > Enhanced Reliability: Cassandra's distributed architecture and > > built-in replication features make it well-suited for storing data > > reliably, even in the face of failures. > > 3. > > > > Scalability: Both Cassandra and Kafka are designed to scale > > horizontally, allowing us to handle increased loads seamlessly. > > > > Before proceeding further with this approach, we would greatly appreciate > > any feedback, suggestions, or concerns from the community. Specifically, > we > > are interested in hearing about: > > > > - Potential challenges or drawbacks of using Cassandra as a backup > > solution for Kafka outages. > > - Best practices or recommendations for implementing such a backup > > mechanism effectively. > > - Any alternative approaches or technologies that we should consider? > > > > Your expertise and insights are invaluable to us, and we are eager to > > learn from your experiences and perspectives. Please feel free to share > > your thoughts or reach out to us with any questions or clarifications. > > > > Thank you for taking the time to consider our proposal, and we look > > forward to hearing from you soon. > > Thanks and regards, > > Karthick > > >