Hi Emmanuel, Can you post your kafka server.properties and in your producer are your distributing your messages into all kafka topic partitions.
-- Harsha On March 20, 2015 at 12:33:02 PM, Emmanuel (ele...@msn.com) wrote: Kafka on test cluster: 2 Kafka nodes, 2GB, 2CPUs 3 Zookeeper nodes, 2GB, 2CPUs Storm: 3 nodes, 3CPUs each, on the same Zookeeper cluster as Kafka. 1 topic, 5 partitions, replication x2 Whether I use 1 slot for the Kafka Spout or 5 slots (=#partitions), the throughput seems about the same. I can't seem to read much more than 7000 events/sec. Same, on writing, I set a generator spout and write to Kafka on 1 topic/5partitions with a KafkaBolt with parallelism of 5 and I can't seem to write much more than 7000 events/sec. Meanwhile, none of the CPU, IO or MEM seem to be a bottleneck: In Storm UI the bolts all show capacities <50%, sometimes much less (in the single digit %) Top shows CPUs being used at ~30% max We have another process moving data from Kafka to Cassandra and it gives similar throughput, so it seems related to Kafka more than Storm. What could be wrong? Sorry for the generic question but I would appreciate any hint on where to start to troubleshoot. Thanks