Guozhang Wang created KAFKA-6038:
------------------------------------

             Summary: Repartition topics could be much more transient
                 Key: KAFKA-6038
                 URL: https://issues.apache.org/jira/browse/KAFKA-6038
             Project: Kafka
          Issue Type: Improvement
          Components: streams
            Reporter: Guozhang Wang


Unlike changelog topics, the repartition topics could just be short-lived than 
eating up the storage space on Kafka brokers. Today users have different ways 
to configure them with short retention such as enforce a retention of 30 
minutes with small log segment sizes, or use AppendTime for repartition topics. 
All these would be cumbersome and Streams should just do this automatically.

One way to do it is use the “purgeData” admin API (KIP-107) such that after the 
offset of the input topics are committed, if the input topics are actually 
repartition topics, we would purge the data immediately. One tricky thing to 
consider though, is upon (re-)starting the application, if the repartition 
topics are used for restoring the states, we need to re-fill these topics in 
the right way in order for restoration purposes, and there might be some devils 
in the implementation details.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to