Jinglun created HDFS-15605: ------------------------------ Summary: DeadNodeDetector supports getting deadnode from NameNode. Key: HDFS-15605 URL: https://issues.apache.org/jira/browse/HDFS-15605 Project: Hadoop HDFS Issue Type: Improvement Reporter: Jinglun
When we are using DeadNodeDetector, sometimes it marks too many nodes as dead and cause the read failures. The DeadNodeDetector assumes all the getDatanodeInfo rpcs failed to return in time are dead nodes. But actually not. A client side error or a slow rpc in DataNode might be marked as dead too. For example the client side delay of the rpcThreadPool might cause the getDatanodeInfo rpcs timeout and adding many datanodes to the dead list. We have a simple improvement for this: the NameNode already knows which datanodes are dead. So just update the dead list from NameNode using DFSClient.datanodeReport(). -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org