Jiandan Yang  created HDFS-17805:
------------------------------------

             Summary: A more detailed flushOrSync Warn Log
                 Key: HDFS-17805
                 URL: https://issues.apache.org/jira/browse/HDFS-17805
             Project: Hadoop HDFS
          Issue Type: Improvement
          Components: datanode
            Reporter: Jiandan Yang 


When DataNode receives a request to flush or sync a block being written by a 
client , the method flushOrSync (boolean isSync) will be invoked.

In this method, the duration of the whole process will  be recorded, also the 
flush and sync time of streams will be summed and then marked as 
‘flushTotalNanos’.

If the duration of this ‘flush or sync’ process exceeds a certain threshold, a 
warn log would appear and tells the value of the duration and the 
‘flushTotalNanos’, enabling us to pinpoint the slower part of the execution 
within the flushOrSync(boolean isSync) methods.

The current log records the time for both flush and sync operations as a single 
metric called flushTotalNanos. However, if flushTotalNanos takes too long, 
users will not be able to further diagnose the issue in detail based on the log.

I propose that the log separately display the durations of flush and sync 
operations, instead of combining them into flushTotalNanos. This would allow 
users to pinpoint issues more accurately.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org

Reply via email to