liuguanghua created HDFS-17285:
----------------------------------

             Summary: [RBF] Decrease dfsrouter safe mode check period.
                 Key: HDFS-17285
                 URL: https://issues.apache.org/jira/browse/HDFS-17285
             Project: Hadoop HDFS
          Issue Type: Improvement
            Reporter: liuguanghua


When dfsrouter start, it enters safe mode. And it will cost 1min to leave.

The log is blow:

14:35:23,717 INFO 
org.apache.hadoop.hdfs.server.federation.router.RouterSafemodeService: Leave 
startup safe mode after 30000 ms
14:35:23,717 INFO 
org.apache.hadoop.hdfs.server.federation.router.RouterSafemodeService: Enter 
safe mode after 180000 ms without reaching the State Store
14:35:23,717 INFO 
org.apache.hadoop.hdfs.server.federation.router.RouterSafemodeService: Entering 
safe mode
14:35:24,996 INFO 
org.apache.hadoop.hdfs.server.federation.router.RouterSafemodeService: Delaying 
safemode exit for 28721 milliseconds...
14:36:25,037 INFO 
org.apache.hadoop.hdfs.server.federation.router.RouterSafemodeService: Leaving 
safe mode after 61319 milliseconds

It depends on these configs.
DFS_ROUTER_SAFEMODE_EXTENSION 30s 
DFS_ROUTER_SAFEMODE_EXPIRATION 3min
DFS_ROUTER_CACHE_TIME_TO_LIVE_MS 1min  (it is the period for check safe mode)

Because in safe mode dfsrouter will reject write requests, so it should be 
shorter in check period if refreshCaches is done.  And we should be separted 
DFS_ROUTER_CACHE_TIME_TO_LIVE_MS form RouterSafemodeService.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org

Reply via email to