anujmodi2021 commented on PR #6069:
URL: https://github.com/apache/hadoop/pull/6069#issuecomment-1739110837

   @steveloughran 
   We found a way to avoid doing explicit array copies in MD5 computation in 
case of non-zero offsets. MessageDigest class of java allows us to do so using 
update and digest mechanism. MessageDigest.update() has a version where we can 
specify buffer, offset and length and after setting this, if we call digest 
function, it will compute MD5Hash of the data updated.
   
   This also points to the fact that same object of MessageDigest class cannot 
be shared among different appends.
   
   Still, we think it's better to have MD5 computation in parallel in 
client.append() only instead of doing it sequentially in ABFSOutputStream while 
creating Datablocks.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org

Reply via email to