[ 
https://issues.apache.org/jira/browse/IGNITE-24945?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mikhail Petrov reassigned IGNITE-24945:
---------------------------------------

    Assignee: Mikhail Petrov

> Node failure with java.nio.channels.CancelledKeyException on session close 
> with SSL enabled.
> --------------------------------------------------------------------------------------------
>
>                 Key: IGNITE-24945
>                 URL: https://issues.apache.org/jira/browse/IGNITE-24945
>             Project: Ignite
>          Issue Type: Bug
>            Reporter: Mikhail Petrov
>            Assignee: Mikhail Petrov
>            Priority: Critical
>              Labels: ise
>             Fix For: 2.18
>
>
> Exception: 
> {code:java}
> 2025-02-18 13:46:36.468 [ERROR][tcp-comm-worker-#1-#138][] Critical system 
> error detected. Will be handled accordingly to configured handler 
> [hnd=StopNodeOrHaltFailureHandler [tryStop=false, timeout=0, 
> super=AbstractFailureHandler [ignoredFailureTypes=UnmodifiableSet 
> [SYSTEM_WORKER_BLOCKED, SYSTEM_CRITICAL_OPERATION_TIMEOUT]]], 
> failureCtx=FailureContext [type=SYSTEM_WORKER_TERMINATION, 
> err=java.nio.channels.CancelledKeyException]]
> java.nio.channels.CancelledKeyException: null
>       at 
> java.base/sun.nio.ch.SelectionKeyImpl.ensureValid(SelectionKeyImpl.java:71) 
> ~[?:?]
>       at 
> java.base/sun.nio.ch.SelectionKeyImpl.interestOps(SelectionKeyImpl.java:90) 
> ~[?:?]
>       at 
> org.apache.ignite.internal.util.nio.GridNioServer$AbstractNioClientWorker.registerWrite(GridNioServer.java:2352)
>  ~[classes/:?]
>       at 
> org.apache.ignite.internal.util.nio.GridNioServer$HeadFilter.onSessionWrite(GridNioServer.java:3738)
>  ~[classes/:?]
>       at 
> org.apache.ignite.internal.util.nio.GridNioFilterAdapter.proceedSessionWrite(GridNioFilterAdapter.java:121)
>  ~[classes/:?]
>       at 
> org.apache.ignite.internal.util.nio.ssl.GridNioSslHandler.writeNetBuffer(GridNioSslHandler.java:502)
>  ~[classes/:?]
>       at 
> org.apache.ignite.internal.util.nio.ssl.GridNioSslFilter.shutdownSession(GridNioSslFilter.java:454)
>  ~[classes/:?]
>       at 
> org.apache.ignite.internal.util.nio.ssl.GridNioSslFilter.onSessionClose(GridNioSslFilter.java:434)
>  ~[classes/:?]
>       at 
> org.apache.ignite.internal.util.nio.GridNioFilterAdapter.proceedSessionClose(GridNioFilterAdapter.java:128)
>  ~[classes/:?]
>       at 
> org.apache.ignite.internal.util.nio.GridConnectionBytesVerifyFilter.onSessionClose(GridConnectionBytesVerifyFilter.java:138)
>  ~[classes/:?]
>       at 
> org.apache.ignite.internal.util.nio.GridNioFilterAdapter.proceedSessionClose(GridNioFilterAdapter.java:128)
>  ~[classes/:?]
>       at 
> org.apache.ignite.internal.util.nio.GridNioCodecFilter.onSessionClose(GridNioCodecFilter.java:137)
>  ~[classes/:?]
>       at 
> org.apache.ignite.internal.util.nio.GridNioFilterAdapter.proceedSessionClose(GridNioFilterAdapter.java:128)
>  ~[classes/:?]
>       at 
> org.apache.ignite.internal.util.nio.GridNioFilterChain$TailFilter.onSessionClose(GridNioFilterChain.java:274)
>  ~[classes/:?]
>       at 
> org.apache.ignite.internal.util.nio.GridNioFilterChain.onSessionClose(GridNioFilterChain.java:203)
>  ~[classes/:?]
>       at 
> org.apache.ignite.internal.util.nio.GridNioSessionImpl.close(GridNioSessionImpl.java:169)
>  ~[classes/:?]
>       at 
> org.apache.ignite.internal.util.nio.GridSelectorNioSessionImpl.close(GridSelectorNioSessionImpl.java:498)
>  ~[classes/:?]
>       at 
> org.apache.ignite.internal.util.nio.GridTcpNioCommunicationClient.close(GridTcpNioCommunicationClient.java:81)
>  ~[classes/:?]
>       at 
> org.apache.ignite.spi.communication.tcp.internal.CommunicationWorker.processIdle(CommunicationWorker.java:278)
>  ~[classes/:?]
>       at 
> org.apache.ignite.spi.communication.tcp.internal.CommunicationWorker.body(CommunicationWorker.java:176)
>  ~[classes/:?]
>       at 
> org.apache.ignite.internal.util.worker.GridWorker.run(GridWorker.java:125) 
> ~[classes/:?]
>       at 
> org.apache.ignite.spi.communication.tcp.TcpCommunicationSpi$3.body(TcpCommunicationSpi.java:848)
>  ~[classes/:?]
>       at org.apache.ignite.spi.IgniteSpiThread.run(IgniteSpiThread.java:58) 
> ~[classes/:?]
> {code}
> Reason: 
> Multiple threads can access one SelectionKey. In this particular case it is - 
> tcp-comm-worker and grid-nio-worker.  
> 1. tcp-comm-worker - checks that key is not closed by invoking 
> SelectionKey#isValid()
> 2. grid-nio-worker - closes the SelectionKey
> 3. tcp-comm-worker - invokes SelectionKey#interestOps which raises 
> CancelledKeyException, which in its turn leads to the node failure.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to