Pritam Kumar created KAFKA-19211:
------------------------------------

             Summary: Connect Storage Topics Sharing Across Clusters
                 Key: KAFKA-19211
                 URL: https://issues.apache.org/jira/browse/KAFKA-19211
             Project: Kafka
          Issue Type: Improvement
          Components: connect
            Reporter: Pritam Kumar


Setting up a *Kafka Connect* cluster requires provisioning three internal 
topics:
 # *Offset Storage Topic* – Tracks offsets of source connectors.
 # *Status Storage Topic* – Maintains the state of connectors and tasks.
 # *Config Storage Topic* – Stores connector configurations.

This *design choice* simplifies migration—enabling seamless replication of 
management topics across regions. However, it introduces {*}operational 
overhead{*}:
 * {*}Every new cluster requires three new topics{*}, leading to an exponential 
increase in topic creation.
 * {*}Cross-team dependencies slow down provisioning{*}, delaying deployments.
 * {*}Compacted topics trigger frequent disk cleanup operations{*}, adding 
maintenance complexity.

While each cluster only requires {*}three topics{*}, their cumulative impact 
grows significantly as more connect clusters are deployed.

But as these topics have very light traffic and are compacted, instead of 
provisioning dedicated topics for every cluster, Kafka Connect clusters can 
*share internal topics* across multiple deployments. This brings {*}immediate 
benefits{*}:
 * *Drastically Reduces Topic Proliferation* – Eliminates unnecessary topic 
creation.
 * *Faster Kafka Connect Cluster Deployment* – No waiting for new topic 
provisioning.
 ** *Large Enterprises with Multiple Teams Using Kafka Connect*
 * *Scenario:* In large organisations, multiple teams manage different *Kafka 
Connect clusters* for various data pipelines.
 * *Benefit:* Instead of waiting for new *internal topics* to be provisioned 
each time a new cluster is deployed, teams can *immediately start* using 
pre-existing shared topics, reducing lead time and improving efficiency.


 ** *Cloud-Native & Kubernetes-Based Deployments*
 * *Scenario:* Many organisations deploy Kafka Connect in *containerised 
environments* (e.g., Kubernetes), where clusters are frequently *scaled 
up/down* or *recreated* dynamically.
 * *Benefit:* Since internal topics are already available, new clusters can 
{*}spin up instantly{*}, without waiting for *topic provisioning* or {*}Kafka 
ACL approvals{*}.
 * How this will help different organisations:


 * *Lower Operational Load* – Reduces disk-intensive cleanup operations.
 ** Broker resource utilization is expected to decrease by approximately 20%, 
primarily due to reduced partition count and metadata overhead. This 
optimization can enable further cluster downscaling, contributing directly to 
lower infrastructure costs (e.g., fewer brokers, reduced EBS storage footprint, 
and lower I/O throughput).
 ** Administrative overhead and monitoring complexity are projected to reduce 
by 30%, due to:
 * Fewer topics to configure, monitor, and apply retention/compaction policies 
to.
 * Reduced rebalancing operations during cluster scale-in or scale-out events.
 * Minimized data movement and replication workload in failure scenarios (e.g., 
EBS volume replacement), as fewer partitions are involved.
 * *Considering the current load* 
 * *Simplified Management* – Less overhead in monitoring and maintaining 
internal topics.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to