Jay Kreps created KAFKA-3015: -------------------------------- Summary: Improve JBOD data balancing Key: KAFKA-3015 URL: https://issues.apache.org/jira/browse/KAFKA-3015 Project: Kafka Issue Type: Improvement Reporter: Jay Kreps
When running with multiple data directories (i.e. JBOD) we currently place partitions entirely within one data directory. This tends to lead to poor balancing across disks as some topics have more throughput/retention and not all disks get data from all topics. You can't fix this problem with smarter partition placement strategies because ultimately you don't know when a partition is created when or how heavily it will be used (this is a subtle point, and the tendency is to try to think of some more sophisticated way to place partitions based on current data size but this is actually exceptionally dangerous and can lead to much worse imbalance when creating many partitions at once as they would all go to the disk with the least data). We don't support online rebalancing across directories/disks so this imbalance is a big problem and limits the usefulness of this configuration. Implementing online rebalancing of data across disks without downtime is actually quite hard and requires lots of I/O since you have to actually rewrite full partitions of data. An alternative would be to place each partition in *all* directories/drives and round-robin *segments* within the partition across the directories. So the layout would be something like: drive-a/mytopic-0/ 0000000.data 0000000.index 0024680.data 0024680.index drive-a/mytopic-0/ 0012345.data 0012345.index 0036912.data 0036912.index This is a little harder to implement than the current approach but not very hard, and it is a lot easier than implementing online data balancing across disks while retaining the current approach. I think this could easily be done in a backwards compatible way. I think the balancing you would get from this in most cases would be good enough to make JBOD the default configuration. Thoughts? -- This message was sent by Atlassian JIRA (v6.3.4#6332)