Karthik Reddy created KAFKA-3391: ------------------------------------ Summary: Kafka to ZK timeout Key: KAFKA-3391 URL: https://issues.apache.org/jira/browse/KAFKA-3391 Project: Kafka Issue Type: Bug Components: consumer, zkclient Affects Versions: 0.8.2.0 Environment: RHEL 7.2, AWS EC2 compute instance Reporter: Karthik Reddy Assignee: Neha Narkhede Priority: Critical
Hi Team, We have seen the below messages in the Kafka logs, indicating there was a timeout on ZK. Could you please advise us on how to tune or better optimize the Kafka-ZK communication. [2016-03-10 02:29:25,858] INFO Unable to read additional data from server sessionid 0x5531d0003f30030, likely server has closed socket, closing socket connection and attempting reconnect (org.apache.zookeeper.ClientCnxn) [2016-03-10 02:29:25,958] INFO zookeeper state changed (Disconnected) (org.I0Itec.zkclient.ZkClient) [2016-03-10 02:29:26,381] INFO Opening socket connection to server 10.200.77.74/10.200.77.74:8164. Will not attempt to authenticate using SASL (unknown error) (org.apache.zookeeper.ClientCnxn) [2016-03-10 02:29:26,382] INFO Socket connection established to 10.200.77.74/10.200.77.74:8164, initiating session (org.apache.zookeeper.ClientCnxn) [2016-03-10 02:29:26,385] INFO Session establishment complete on server 10.200.77.74/10.200.77.74:8164, sessionid = 0x5531d0003f30030, negotiated timeout = 6000 (org.apache.zookeeper.ClientCnxn) [2016-03-10 02:29:26,385] INFO zookeeper state changed (SyncConnected) (org.I0Itec.zkclient.ZkClient) [2016-03-10 02:29:30,961] INFO conflict in /controller data: {"version":1,"brokerid":3,"timestamp":"1457594970952"} stored data: {"version":1,"brokerid":5,"timestamp":"1457594970043"} (kafka.utils.ZkUtils$) [2016-03-10 02:29:30,969] INFO New leader is 5 (kafka.server.ZookeeperLeaderElector$LeaderChangeListener) [2016-03-10 02:29:31,620] INFO [ReplicaFetcherManager on broker 3] Removed fetcher for partitions [__consumer_offsets,0],[fulfillment.payments.autopay.mongooperation.response,1],[__consumer_offsets,20],[__consumer_offsets,40] (kafka.server.ReplicaFetcherManager) [2016-03-10 02:29:31,621] INFO [ReplicaFetcherManager on broker 3] Removed fetcher for partitions [efit.framework.notification.error,1],[__consumer_offsets,15],[fulfillment.payments.autopay.processexception.notification,1],[__consumer_offsets,35] (kafka.server.ReplicaFetcherManager) [2016-03-10 02:29:31,621] INFO Truncating log efit.framework.notification.error-1 to offset 637. (kafka.log.Log) [2016-03-10 02:29:31,621] INFO Truncating log __consumer_offsets-15 to offset 0. (kafka.log.Log) [2016-03-10 02:29:31,622] INFO Truncating log fulfillment.payments.autopay.processexception.notification-1 to offset 0. (kafka.log.Log) [2016-03-10 02:29:31,622] INFO Truncating log __consumer_offsets-35 to offset 0. (kafka.log.Log) [2016-03-10 02:29:31,623] INFO Loading offsets from [__consumer_offsets,0] (kafka.server.OffsetManager) [2016-03-10 02:29:31,624] INFO Loading offsets from [__consumer_offsets,20] (kafka.server.OffsetManager) [2016-03-10 02:29:31,624] INFO Finished loading offsets from [__consumer_offsets,0] in 1 milliseconds. (kafka.server.OffsetManager) [2016-03-10 02:29:31,625] INFO Loading offsets from [__consumer_offsets,40] (kafka.server.OffsetManager) [2016-03-10 02:29:31,625] INFO Finished loading offsets from [__consumer_offsets,20] in 1 milliseconds. (kafka.server.OffsetManager) [2016-03-10 02:29:31,625] INFO Finished loading offsets from [__consumer_offsets,40] in 0 milliseconds. (kafka.server.OffsetManager) [2016-03-10 02:29:31,627] INFO [ReplicaFetcherManager on broker 3] Added fetcher for partitions List([[efit.framework.notification.error,1], initOffset 637 to broker id:1,host:10.200.77.78,port:8165] , [[__consumer_offsets,15], initOffset 0 to broker id:1,host:10.200.77.78,port:8165] , [[fulfillment.payments.autopay.processexception.notification,1], initOffset 0 to broker id:5,host:10.200.75.150,port:8165] , [[__consumer_offsets,35], initOffset 0 to broker id:1,host:10.200.77.78,port:8165] ) (kafka.server.ReplicaFetcherManager) [2016-03-10 02:29:31,627] INFO [ReplicaFetcherThread-0-2], Shutting down (kafka.server.ReplicaFetcherThread Thanks, Karthik -- This message was sent by Atlassian JIRA (v6.3.4#6332)