Vladislav Pyatkov created IGNITE-11425:
------------------------------------------

             Summary: Log information about inaccessible nodes through 
Communication
                 Key: IGNITE-11425
                 URL: https://issues.apache.org/jira/browse/IGNITE-11425
             Project: Ignite
          Issue Type: Improvement
            Reporter: Vladislav Pyatkov


In case of long getting communication TCP client (longe than this 
CONNECTION_ESTABLISH_THRESHOLD_MS = 100) message will printed:

{noformat}
[sys-#20167%dht.CacheGetReadFromBackupFailoverTest0%][TcpCommunicationSpi] TCP 
client created [client=GridTcpNioCommunicationClient 
[ses=GridSelectorNioSessionImpl [worker=DirectNioClientWorker 
[super=AbstractNioClientWorker [idx=3, bytesRcvd=0, bytesSent=0, bytesRcvd0=0, 
bytesSent0=0, select=true, super=GridWorker [name=grid-nio-worker-tcp-comm-3, 
igniteInstanceName=dht.CacheGetReadFromBackupFailoverTest0, finished=false, 
heartbeatTs=1550512236151, hashCode=140561231, interrupted=false, 
runner=grid-nio-worker-tcp-comm-3-#20147%dht.CacheGetReadFromBackupFailoverTest0%]]],
 writeBuf=java.nio.DirectByteBuffer[pos=0 lim=32768 cap=32768], 
readBuf=java.nio.DirectByteBuffer[pos=0 lim=32768 cap=32768], 
inRecovery=GridNioRecoveryDescriptor [acked=0, resendCnt=0, rcvCnt=0, 
sentCnt=0, reserved=true, lastAck=0, nodeLeft=false, node=TcpDiscoveryNode 
[id=8a660330-6ddb-4031-b955-4cb4f4b00002, addrs=ArrayList [127.0.0.1], 
sockAddrs=HashSet [/127.0.0.1:47502], discPort=47502, order=5, intOrder=4, 
lastExchangeTime=1550512235890, loc=false, ver=2.8.0#20190218-sha1:29232e37, 
isClient=false], connected=false, connectCnt=2, queueLimit=4096, reserveCnt=2, 
pairedConnections=false], outRecovery=GridNioRecoveryDescriptor [acked=0, 
resendCnt=0, rcvCnt=0, sentCnt=0, reserved=true, lastAck=0, nodeLeft=false, 
node=TcpDiscoveryNode [id=8a660330-6ddb-4031-b955-4cb4f4b00002, addrs=ArrayList 
[127.0.0.1], sockAddrs=HashSet [/127.0.0.1:47502], discPort=47502, order=5, 
intOrder=4, lastExchangeTime=1550512235890, loc=false, 
ver=2.8.0#20190218-sha1:29232e37, isClient=false], connected=false, 
connectCnt=2, queueLimit=4096, reserveCnt=2, pairedConnections=false], 
super=GridNioSessionImpl [locAddr=/127.0.0.1:38770, rmtAddr=/127.0.0.1:45212, 
createTime=1550512236151, closeTime=0, bytesSent=0, bytesRcvd=0, bytesSent0=0, 
bytesRcvd0=0, sndSchedTime=1550512236151, lastSndTime=1550512236151, 
lastRcvTime=1550512236151, readsPaused=false, 
filterChain=FilterChain[filters=[GridNioCodecFilter 
[parser=org.apache.ignite.internal.util.nio.GridDirectParser@d240a48, 
directMode=true], GridConnectionBytesVerifyFilter], accepted=false, 
markedForClose=false]], super=GridAbstractCommunicationClient 
[lastUsed=1550512236151, closed=false, connIdx=0]], duration=211ms]
{noformt}

but in some cases we can not to get client during time out, and the message 
reduce to
 
TCP client created [client=null, duration=60004 ms]

According to the message you cannot understand which nodes were inaccessible.
Moreover, wants to see the connection trouble earlier than the 10 minutes after.

Should to log ip/host for clear understanding what was the node and log WARN 
message each time when need to increase timeout:
{code}
if (lastWaitingTimeout < 60000)
  lastWaitingTimeout *= 2;
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to