[ 
https://issues.apache.org/jira/browse/KAFKA-5256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16012717#comment-16012717
 ] 

Guozhang Wang commented on KAFKA-5256:
--------------------------------------

[~twbecker] Thanks for reporting this. Could you elaborate a bit more on the 
correctness impact it may incur? What do you mean by "records that have been 
deleted from the backing topic", is it deleted because of log retention 
policies or because of log compaction? Since the changelog messages are always 
"overrides" replaying its messages should be idempotent?

> Non-checkpointed state stores should be deleted before restore
> --------------------------------------------------------------
>
>                 Key: KAFKA-5256
>                 URL: https://issues.apache.org/jira/browse/KAFKA-5256
>             Project: Kafka
>          Issue Type: Bug
>          Components: streams
>    Affects Versions: 0.10.2.1
>            Reporter: Tommy Becker
>
> Currently, Kafka Streams will re-use an existing state store even if there is 
> no checkpoint for it. This seems both inefficient (because duplicate inserts 
> can be made on restore) and incorrect (records which have been deleted from 
> the backing topic may still exist in the store). Since the contents of a 
> store with no checkpoint are unknown, the best way to proceed would be to 
> delete the store and recreate before restoring.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to