Prateek Gaur created HDFS-7650: ---------------------------------- Summary: Snapshot Corruption for OPENFORWRITE files Key: HDFS-7650 URL: https://issues.apache.org/jira/browse/HDFS-7650 Project: Hadoop HDFS Issue Type: Bug Affects Versions: 2.5.1 Reporter: Prateek Gaur
I have encountered snapshot corruption issue when creating a snapshot while some files were open for being written. Attached are the sample config and program that I have written that try to reproduce the issue. After taking the snapshot : the OPENFORWRITE file is shown as corrupted with size 0 and 1 block missing. Output of fsck : Command: path/to/hadoop-2.5.1/bin/hdfs fsck -conf /tmp/hdfs_config.UKhJXu -includeSnapshots /.snapshot/test_snapshot -files exit_code 256 Output: FSCK started by prateek (auth:SIMPLE) from /127.0.0.1 for path /.snapshot/test_snapshot at Tue Jan 20 17:34:07 PST 2015 /.snapshot/test_snapshot <dir> /.snapshot/test_snapshot/b <dir> /.snapshot/test_snapshot/b/c 0 bytes, 1 block(s): MISSING 1 blocks of total size 0 B Status: CORRUPT Total size: 0 B Total dirs: 2 Total files: 1 Total symlinks: 0 Total blocks (validated): 1 (avg. block size 0 B) ******************************** CORRUPT FILES: 1 ******************************** Minimally replicated blocks: 0 (0.0 %) Over-replicated blocks: 0 (0.0 %) Under-replicated blocks: 0 (0.0 %) Mis-replicated blocks: 0 (0.0 %) Default replication factor: 3 Average block replication: 0.0 Corrupt blocks: 0 Missing replicas: 0 Number of data-nodes: 1 Number of racks: 1 FSCK ended at Tue Jan 20 17:34:07 PST 2015 in 3 milliseconds The filesystem under path '/.snapshot/test_snapshot' is CORRUPT -- This message was sent by Atlassian JIRA (v6.3.4#6332)