Hi, I got this from the log:
[exec] 03:50:06.216 [ProducerSendThread-] [1;31mERROR [0;39m [1;35mk.producer.async.ProducerSendThread [0;39m - Error in handling batch of 100 events [exec] kafka.common.FailedToSendMessageException: Failed to send messages after 3 tries. [exec] at kafka.producer.async.DefaultEventHandler.handle(Unknown Source) ~[kafka_2.10-0.8.0.jar:0.8.0] [exec] at kafka.producer.async.ProducerSendThread.tryToHandle(Unknown Source) [kafka_2.10-0.8.0.jar:0.8.0] [exec] at kafka.producer.async.ProducerSendThread$$anonfun$processEvents$3.apply(Unknown Source) ~[kafka_2.10-0.8.0.jar:0.8.0] [exec] at kafka.producer.async.ProducerSendThread$$anonfun$processEvents$3.apply(Unknown Source) ~[kafka_2.10-0.8.0.jar:0.8.0] [exec] at scala.collection.immutable.Stream.foreach(Stream.scala:547) ~[scala-library-2.10.1.jar:na] [exec] at kafka.producer.async.ProducerSendThread.processEvents(Unknown Source) [kafka_2.10-0.8.0.jar:0.8.0] [exec] at kafka.producer.async.ProducerSendThread.run(Unknown Source) [kafka_2.10-0.8.0.jar:0.8.0] My question is: 1. What could cause this? 2. Who should deal with the recovery? User or Kafka? Let me know if more log is needed. (It hangs the integration test, so I have 3.5G of them ... from a maven build) -- Best Regards, Mingtao Zhang