Dong Lin created KAFKA-5335:
-------------------------------

             Summary: Controller should batch updatePartitionReassignmentData() 
operation
                 Key: KAFKA-5335
                 URL: https://issues.apache.org/jira/browse/KAFKA-5335
             Project: Kafka
          Issue Type: Bug
            Reporter: Dong Lin
            Assignee: Dong Lin


Currently controller will update partition reassignment data every time a 
partition in the reassignment is completed. It means that if user specifies a 
huge reassignment znode of size 1 MB to move 10K partitions, controller will 
need to write roughly 0.5 MB * 10000 = 5 GB data to zookeeper in order to 
complete this reassignment. This is because controller needs to write the 
remaining partitions to the znode every time a partition is completely moved.

This is problematic because such a huge reassignment may greatly slow down 
Kafka controller. Note that partition reassignment doesn't necessarily cause 
data movement between brokers because we may use it only to recorder the 
replica list of partitions to evenly distribute preferred leader.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to