anujmodi2021 commented on PR #6069: URL: https://github.com/apache/hadoop/pull/6069#issuecomment-1739110837
@steveloughran We found a way to avoid doing explicit array copies in MD5 computation in case of non-zero offsets. MessageDigest class of java allows us to do so using update and digest mechanism. MessageDigest.update() has a version where we can specify buffer, offset and length and after setting this, if we call digest function, it will compute MD5Hash of the data updated. This also points to the fact that same object of MessageDigest class cannot be shared among different appends. Still, we think it's better to have MD5 computation in parallel in client.append() only instead of doing it sequentially in ABFSOutputStream while creating Datablocks. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org