Ahmar Suhail created HADOOP-19364: ------------------------------------- Summary: Support IoStatistics for new analytics-accelerator stream Key: HADOOP-19364 URL: https://issues.apache.org/jira/browse/HADOOP-19364 Project: Hadoop Common Issue Type: Sub-task Components: fs/s3 Reporter: Ahmar Suhail
S3A provides InputStream statistics: [https://github.com/apache/hadoop/blob/trunk/hadoop-tools/hadoop-aws/src/main/java/org/apache/hadoop/fs/s3a/statistics/S3AInputStreamStatistics.java] This helps track things like how many bytes were read from a stream etc. The current integration does not currently implement statistics. To start off with we should identify which of these statistics makes sense for us track in the new stream. Some examples are: 1/ bytesRead 2/ readOperationStarted 3/ initiateGetRequest Some of these (1 and 2) are more straightforward, and should not require any changes to analytics-accelerator-s3, but tracking GET requests will require this. We should also add tests that make assertions on these statistics. See ITestS3APrefetchingInputStream for an example to do this. And see https://issues.apache.org/jira/browse/HADOOP-18190 for how this was done on the prefetching stream, and PR: https://github.com/apache/hadoop/pull/4458 -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-dev-h...@hadoop.apache.org