[ https://issues.apache.org/jira/browse/HDFS-14659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Chen Zhang resolved HDFS-14659. ------------------------------- Resolution: Abandoned duplicate jira due to apache jira server error > Refine NameSystem lock usage during processing FBR > -------------------------------------------------- > > Key: HDFS-14659 > URL: https://issues.apache.org/jira/browse/HDFS-14659 > Project: Hadoop HDFS > Issue Type: Improvement > Reporter: Chen Zhang > Priority: Major > > The disk with 12TB capacity is very normal today, which means the FBR size is > much larger than before, BlockManager holds the NameSystemLock during > processing block report for each storage, which might take quite a long time. > On our production environment, processing large FBR usually cause a longer > RPC queue time, which impacts client latency, so we did some simple work on > refining the lock usage, which improved the p99 latency significantly. > In our solution, BlockManager release the NameSystem write lock and request > it again for every 5000 blocks(by default) during processing FBR, with the > fair lock, all the RPC request can be processed before BlockManager > re-acquire the write lock. -- This message was sent by Atlassian JIRA (v7.6.14#76016) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org