[ https://issues.apache.org/jira/browse/HDFS-1103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tsz Wo (Nicholas), SZE resolved HDFS-1103. ------------------------------------------ Resolution: Invalid As explained by Hairong, this is invalid. Please feel free to reopen if you think it is not. Resolving. > Replica recovery doesn't distinguish between flushed-but-corrupted last chunk > and unflushed last chunk > ------------------------------------------------------------------------------------------------------ > > Key: HDFS-1103 > URL: https://issues.apache.org/jira/browse/HDFS-1103 > Project: Hadoop HDFS > Issue Type: Bug > Components: data-node > Affects Versions: 0.21.0, 0.22.0, 0.23.0, 0.24.0 > Reporter: Todd Lipcon > Priority: Blocker > Attachments: hdfs-1103-test.txt > > > When the DN creates a replica under recovery, it calls validateIntegrity, > which truncates the last checksum chunk off of a replica if it is found to be > invalid. Then when the block recovery process happens, this shortened block > wins over a longer replica from another node where there was no corruption. > Thus, if just one of the DNs has an invalid last checksum chunk, data that > has been sync()ed to other datanodes can be lost. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira