Henry Cai created KAFKA-19225: --------------------------------- Summary: Tiered Storage Support for Active Log Segment Key: KAFKA-19225 URL: https://issues.apache.org/jira/browse/KAFKA-19225 Project: Kafka Issue Type: New Feature Components: Tiered-Storage Affects Versions: 4.0.0 Reporter: Henry Cai Assignee: Henry Cai Fix For: 4.0.1
This is the Jira for [KIP-1176: Tiered Storage Support for Active Log Segment|[https://cwiki.apache.org/confluence/display/KAFKA/KIP-1176%3A+Tiered+Storage+for+Active+Log+Segment]] In KIP-405, the community has proposed and implemented the tiered storage for old Kafka log segment files, when the log segments is older than {_}local.retention.ms{_}, it becomes eligible to be uploaded to cloud's object storage and removed from the local storage thus reducing local storage cost. KIP-405 only uploads older log segments but not the most recent active log segments (write-ahead logs). Thus in a typical 3-way replicated Kafka cluster, the 2 follower brokers would still need to replicate the active log segments from the leader broker. It is common practice to set up the 3 brokers in three different AZs to improve the high availability of the cluster. This would cause the replications between leader/follower brokers to be across AZs which is a significant cost ([various studies|https://www.confluent.io/blog/understanding-and-optimizing-your-kafka-costs-part-1-infrastructure/] show the across AZ transfer cost typically comprises 50%-60% of the total cluster cost). Since all the active log segments are physically present on three Kafka Brokers, they still comprise significant resource usage on the brokers. The state of the broker is still quite big during node replacement, leading to longer node replacement time. [KIP-1150|https://cwiki.apache.org/confluence/display/KAFKA/KIP-1150%3A+Diskless+Topics] recently proposes diskless Kafka topic, but leads to increased latency and a significant redesign. In comparison, this proposed KIP maintains identical performance for acks=1 producer path, minimizes design changes to Kafka, and still slashes cost by an estimated 43%. -- This message was sent by Atlassian Jira (v8.20.10#820010)