Daryn Sharp created HADOOP-11780: ------------------------------------ Summary: Prevent IPC reader thread death Key: HADOOP-11780 URL: https://issues.apache.org/jira/browse/HADOOP-11780 Project: Hadoop Common Issue Type: Bug Components: ipc Affects Versions: 2.0.0-alpha Reporter: Daryn Sharp Priority: Critical
Reader threads can die to a race condition with the responder thread. If the server's ipc handler cannot send a response in one write, it delegates sending the rest of the response to the responder thread. The race occurs when the responder thread has an exception writing to the socket. The responder closes the socket. This wakes up the reader polling on the socket. If a {{CancelledKeyException}} is thrown, which is a runtime exception, the reader dies. All connections serviced by that reader are now in limbo until the client possibly times out. New connections play roulette as to whether they are assigned to a defunct reader. -- This message was sent by Atlassian JIRA (v6.3.4#6332)