Walter Su created HDFS-8339: ------------------------------- Summary: Erasure Coding: Badly treated when createBlockOutputStream failed in DataStreamer Key: HDFS-8339 URL: https://issues.apache.org/jira/browse/HDFS-8339 Project: Hadoop HDFS Issue Type: Sub-task Reporter: Walter Su
h4. Issue 1: leading streamer calls abandonBlock(..) and get new blockGroup from NN, primary streamer should sync with non-leading streamer instead of throw exception( offer twice to stripedBlock blockingQueue). {noformat} 2015-05-07 18:58:05,335 INFO hdfs.DataStreamer (DataStreamer.java:nextBlockOutputStream(1386)) - Abandoning BP-172584615-9.96.1.34-1430996280714:blk_-9223372036854775792_1001 ... 2015-05-07 18:58:05,373 WARN hdfs.DataStreamer (DataStreamer.java:run(572)) - DataStreamer Exception java.io.IOException: Failed: LocatedBlock{BP-172584615-9.96.1.34-1430996280714:blk_-9223372036854775770_1002; getBlockSize()=0; corrupt=false; offset=1572864; locs=[DatanodeInfoWithStorage[127.0.0.1:52490,DS-6080b76f-adf7-45a8-aa0e-e0e82c2c1569,DISK]]}, i=6 at org.apache.hadoop.hdfs.DFSStripedOutputStream$Coordinator.putStripedBlock(DFSStripedOutputStream.java:117) at org.apache.hadoop.hdfs.StripedDataStreamer.locateFollowingBlock(StripedDataStreamer.java:120) at org.apache.hadoop.hdfs.DataStreamer.nextBlockOutputStream(DataStreamer.java:1364) at org.apache.hadoop.hdfs.DataStreamer.run(DataStreamer.java:461) at org.apache.hadoop.hdfs.StripedDataStreamer.run(StripedDataStreamer.java:48) {noformat} h4. Issue 2: non-leading streamer calls abandonBlock(..) and get new locatedBlock from coordinator. Actually it's the last blockGroup, no more locatedBlock can poll from stripedBlocks blockingQueue. Other 8 streamer finished and closed, but this streamer hangs about 90 seconds. {noformat} 2015-05-07 19:21:25,357 INFO BlockStateChange (BlockManager.java:logAddStoredBlock(2768)) - BLOCK* addStoredBlock: blockMap updated: 127.0.0.1:51998 is added to ... 2015-05-07 19:22:55,250 WARN hdfs.DataStreamer (DataStreamer.java:run(572)) - DataStreamer Exception java.io.IOException: Failed: i=1 at org.apache.hadoop.hdfs.DFSStripedOutputStream$Coordinator.getStripedBlock(DFSStripedOutputStream.java:130) at org.apache.hadoop.hdfs.StripedDataStreamer.locateFollowingBlock(StripedDataStreamer.java:124) at org.apache.hadoop.hdfs.DataStreamer.nextBlockOutputStream(DataStreamer.java:1364) at org.apache.hadoop.hdfs.DataStreamer.run(DataStreamer.java:461) at org.apache.hadoop.hdfs.StripedDataStreamer.run(StripedDataStreamer.java:48) {noformat} h4. Issue 3: remove abandonBlock(..) RPC call for non-leading streamer -- This message was sent by Atlassian JIRA (v6.3.4#6332)