Shawn Wang created KAFKA-14016:
----------------------------------

             Summary: Revoke more partitions than expected in Cooperative 
rebalance
                 Key: KAFKA-14016
                 URL: https://issues.apache.org/jira/browse/KAFKA-14016
             Project: Kafka
          Issue Type: Bug
          Components: clients
    Affects Versions: 3.3.0
            Reporter: Shawn Wang


In https://issues.apache.org/jira/browse/KAFKA-13419 we found that some 
consumer didn't reset generation and state after sync group fail with 
REABALANCE_IN_PROGRESS error.

So we fixed it by reset generationId (no memberId) when  sync group fail with 
REABALANCE_IN_PROGRESS error.

But this change missed the reset part, so another change made in 
https://issues.apache.org/jira/browse/KAFKA-13891 make this works.

After apply this change, we found that: sometimes consumer will revoker almost 
1/3 of the partitions with cooperative enabled. Because if a consumer did a 
very quick re-join, other consumers will get REABALANCE_IN_PROGRESS in 
syncGroup and revoked their partition before re-jion.

 

I think the whole history is : in Cooperative rebalance, we found a duplicate 
assign bug https://issues.apache.org/jira/browse/KAFKA-12984 and have several 
fixes:
 # 
https://issues.apache.org/jira/browse/KAFKA-12983

 * 
 **  



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to