kafka-reassign-partitions.sh leaves data on source broker?

2014-05-31 Thread Otis Gospodnetic
Hi, I have 2 pretty unbalanced brokers running 0.8.1.1 and I'm trying to balance them (read: free disk space on the overloaded broker). Is there a tool that will just rebalance everything? I couldn't find one, so I ran this: $ sh kafka-reassign-partitions.sh --broker-list 1 --zookeeper zk1.prod

Num of partitions != num.partitions in config

2014-05-31 Thread Otis Gospodnetic
Hi, I have Kafka 0.8.1.1 running and configured to have 24 partitions: $ grep partition /usr/lib/kafka_2.8.0-0.8.1.1/config/server.properties # The default number of log partitions per topic. More partitions allow greater num.partitions=24 But some of the topics don't seem to have 24 partitions,

Re: Topic Reassignment Tool Improvements

2014-05-31 Thread Otis Gospodnetic
+1 for sticking this on the Wiki or in JIRA without your internal stuff and letting those who need it (like us) clean up for inclusion. We've got a need for this, too. :) Thanks, Otis -- Performance Monitoring * Log Analytics * Search Analytics Solr & Elasticsearch Support * http://sematext.com/

Re: kafka-reassign-partitions.sh leaves data on source broker?

2014-05-31 Thread Guozhang Wang
Hello Otis, What is your replication factor? Guozhang On Sat, May 31, 2014 at 12:39 PM, Otis Gospodnetic < otis.gospodne...@gmail.com> wrote: > Hi, > > I have 2 pretty unbalanced brokers running 0.8.1.1 and I'm trying to > balance them (read: free disk space on the overloaded broker). > > Is t

Re: Num of partitions != num.partitions in config

2014-05-31 Thread Guozhang Wang
Hi Otis, The partition segment files are only created when the first message of this partition as arrived, so 1) if your producer send keyed messages which specifies their partitions or 2) your producer has not sent enough data (in 0.8.x it stick to one partition for a while before going to the ne

Re: kafka-reassign-partitions.sh leaves data on source broker?

2014-05-31 Thread Otis Gospodnetic
Hi Guozhang, I don't have any replication-related properties in my config file and I don't see any mentions of replication at startup in the log, so I don't really know. The only time I saw replication mentioned in the log was during some errors that said my replication factor was 1 and that was

Re: kafka-reassign-partitions.sh leaves data on source broker?

2014-05-31 Thread Guozhang Wang
You may take a look at the broker configs for replication factor: http://kafka.apache.org/documentation.html#brokerconfigs With replication factor 1 (default value) the broker holding the old leader should received StopReplica Request to stop the replica and delete the segment files. Do you see t

Re: kafka-reassign-partitions.sh leaves data on source broker?

2014-05-31 Thread Otis Gospodnetic
Hi, No, I don't think I saw anything with "Stop" in the logs, but I may have lost those old logs, so I'm not 100% sure. I just re-ran the same command and I didn't see anything with "Stop" in broker 2's (source) log. I did see log messages like the following one in broker 2's logs as soon as I r