[ 
https://issues.apache.org/jira/browse/KAFKA-10137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17168234#comment-17168234
 ] 

Sophie Blee-Goldman commented on KAFKA-10137:
---------------------------------------------

I think there's actually a real bug lurking here: I was looking at the 
ChangeLoggingWindowBytesStore and noticed we seem to insert the sequence number 
into the changelogged bytes regardless of `retainDuplicates`. 

We peel off the unnecessary seqnum during restoration, so it doesn't seem to 
cause any correctness issues. But we're obviously storing an extra 4 bytes per 
window store changelog record for no reason. Unfortunatel,y I'm not sure how 
this can be fixed in a backwards compatible way

> Clean-up retain Duplicate logic in Window Stores
> ------------------------------------------------
>
>                 Key: KAFKA-10137
>                 URL: https://issues.apache.org/jira/browse/KAFKA-10137
>             Project: Kafka
>          Issue Type: Task
>          Components: streams
>    Affects Versions: 2.5.0
>            Reporter: Bruno Cadonna
>            Priority: Minor
>
> Stream-stream joins use the regular `WindowStore` implementation but with 
> `retainDuplicates` set to true. To allow for duplicates while using the same 
> unique-key underlying stores we just wrap the key with an incrementing 
> sequence number before inserting it.
> The logic to maintain and append the sequence number is present in multiple 
> locations, namely in the changelogging window store and in its underlying 
> window stores. We should consolidate this code to one single location.  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to