Hi,
Sorry to confuse you in my last mail. Actual in my case Kylin can reconnect to
zookeeper, but seems something wrong, and kylin loops in session time out,
reconnecting...
It seems:
1. kylin reconnected to zookeeper in 19:00:59
2. session timed out, kylin have not heard from zookeeper, then closed
connection.
3. zookeeper threw EndOfStreamException for kylin closed socket.
Who knows why this kind of things happens? Any suggestions?
Here is the log:
2016-08-18 19:00:15,839 INFO [Thread-10-EventThread]
state.ConnectionStateManager:228 : State change: RECONNECTED
2016-08-18 19:00:59,527 INFO [Thread-10-SendThread(bigdata-3:2181)]
zookeeper.ClientCnxn:1096 : Client session timed out, have not heard from
server in 43681ms for sessionid 0x1569cf2765a0012, closing socket connection
and attempting reconnect
2016-08-18 19:00:59,628 INFO [Thread-10-EventThread]
state.ConnectionStateManager:228 : State change: SUSPENDED
2016-08-18 19:00:59,717 INFO [Thread-10-SendThread(bigdata-2:2181)]
zookeeper.ClientCnxn:975 : Opening socket connection to server
bigdata-2/192.168.0.25:2181. Will not attempt to authenticate using SASL
(unknown error)
2016-08-18 19:00:59,722 INFO [Thread-10-SendThread(bigdata-2:2181)]
zookeeper.ClientCnxn:852 : Socket connection established to
bigdata-2/192.168.0.25:2181, initiating session
2016-08-18 19:00:59,740 INFO [Thread-10-SendThread(bigdata-2:2181)]
zookeeper.ClientCnxn:1235 : Session establishment complete on server
bigdata-2/192.168.0.25:2181, sessionid = 0x1569cf2765a0012, negotiated timeout
= 60000
2016-08-18 19:00:59,741 INFO [Thread-10-EventThread]
state.ConnectionStateManager:228 : State change: RECONNECTED
2016-08-18 19:00:59,923 INFO [pool-8-thread-1] threadpool.DefaultScheduler:109
: Job Fetcher: 1 running, 1 actual running, 0 ready, 104 others
2016-08-18 19:01:39,118 INFO [pool-8-thread-1] threadpool.DefaultScheduler:109
: Job Fetcher: 1 running, 1 actual running, 0 ready, 104 others
2016-08-18 19:02:25,557 INFO [Thread-10-SendThread(bigdata-2:2181)]
zookeeper.ClientCnxn:1096 : Client session timed out, have not heard from
server in 46832ms for sessionid 0x1569cf2765a0012, closing socket connection
and attempting reconnect
2016-08-18 19:02:25,659 INFO [Thread-10-EventThread]
state.ConnectionStateManager:228 : State change: SUSPENDED
2016-08-18 19:02:25,955 INFO [pool-8-thread-1] threadpool.DefaultScheduler:109
: Job Fetcher: 1 running, 1 actual running, 0 ready, 104 others
2016-08-18 19:02:28,357 INFO [Thread-10-SendThread(bigdata-4:2181)]
zookeeper.ClientCnxn:975 : Opening socket connection to server
bigdata-4/192.168.0.7:2181. Will not attempt to authenticate using SASL
(unknown error)
2016-08-18 19:02:28,361 INFO [Thread-10-SendThread(bigdata-4:2181)]
zookeeper.ClientCnxn:852 : Socket connection established to
bigdata-4/192.168.0.7:2181, initiating session
2016-08-18 19:02:28,371 INFO [Thread-10-SendThread(bigdata-4:2181)]
zookeeper.ClientCnxn:1235 : Session establishment complete on server
bigdata-4/192.168.0.7:2181, sessionid = 0x1569cf2765a0012, negotiated timeout =
60000
2016-08-18 19:02:28,371 INFO [Thread-10-EventThread]
state.ConnectionStateManager:228 : State change: RECONNECTED
Also log from zookeeper:
2016-08-18 19:00:59,726 [myid:2] - INFO
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2181:NIOServerCnxnFactory@197] - Accepted
socket connection from /192.168.0.10:54427
2016-08-18 19:00:59,728 [myid:2] - INFO
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2181:ZooKeeperServer@861] - Client
attempting to renew session 0x1569cf2765a0012 at /192.168.0.10:54427
2016-08-18 19:00:59,729 [myid:2] - INFO
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2181:Learner@108] - Revalidating client:
0x1569cf2765a0012
2016-08-18 19:00:59,741 [myid:2] - INFO
[QuorumPeer[myid=2]/0:0:0:0:0:0:0:0:2181:ZooKeeperServer@617] - Established
session 0x1569cf2765a0012 with negotiated timeout 60000 for client
/192.168.0.10:54427
2016-08-18 19:02:25,564 [myid:2] - WARN
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2181:NIOServerCnxn@357] - caught end of
stream exception
EndOfStreamException: Unable to read additional data from client sessionid
0x1569cf2765a0012, likely client has closed socket
at
org.apache.zookeeper.server.NIOServerCnxn.doIO(NIOServerCnxn.java:228)
at
org.apache.zookeeper.server.NIOServerCnxnFactory.run(NIOServerCnxnFactory.java:208)
at java.lang.Thread.run(Thread.java:745)
2016-08-18 19:02:25,567 [myid:2] - INFO
[NIOServerCxn.Factory:0.0.0.0/0.0.0.0:2181:NIOServerCnxn@1007] - Closed socket
connection for client /192.168.0.10:54427 which had sessionid 0x1569cf2765a0012