Zhao Yi Ming created HDFS-14699:
-----------------------------------

             Summary: Erasure Coding: Can NOT trigger the reconstruction when 
have the dup internal blocks and missing one internal block
                 Key: HDFS-14699
                 URL: https://issues.apache.org/jira/browse/HDFS-14699
             Project: Hadoop HDFS
          Issue Type: Bug
          Components: ec
    Affects Versions: 3.1.1
            Reporter: Zhao Yi Ming


We are tried the EC function on 80 node cluster with hadoop 3.1.1, we hit the 
same scenario as you said. Could we ask when and which version the fix can be 
merged? Thanks! Following are our testing steps, hope it can helpful.(following 
DNs have the testing internal blocks)
 # we customized a new 10-2-1024k policy and use it on a path, now we have 12 
internal block(12 live block)
 # decommission one DN, after the decommission complete. now we have 13 
internal block(12 live block and 1 decommission block)
 # then shutdown one DN which did not have the same block id as 1 decommission 
block, now we have 12 internal block(11 live block and 1 decommission block)
 # after wait for about 600s (before the heart beat come) commission the 
decommissioned DN again, now we have 12 internal block(11 live block and 1 
duplicate block)
 # Then the EC is not reconstruct the missed block

We think this is a critical issue for using the EC function in a production 
env. Could you help? Thanks a lot!



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org

Reply via email to