Zehao Chen created HDFS-15811: --------------------------------- Summary: completeFile should log final file size Key: HDFS-15811 URL: https://issues.apache.org/jira/browse/HDFS-15811 Project: Hadoop HDFS Issue Type: Improvement Reporter: Zehao Chen Assignee: Zehao Chen
Jobs, particularly hive queries by non-headless users, can create an excessive number of files (many hundreds of thousands). A single user's query can generate a sustained burst of 60-80% of all creates for tens of minutes or more and impact overall cluster performance. Adding the file size to the logline allows us to identify excessive tiny or large files. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org