Hi, For HDFS-7281, we are making a change in fsck report:
Before the change: CORRUPT FILES: 29 MISSING BLOCKS: 29 MISSING SIZE: 576920501 B CORRUPT BLOCKS: 29 ... Default replication factor: 3 Average block replication: 2.7412367 Corrupt blocks: 0 Missing replicas: 0 (0.0 %) Number of data-nodes: 12 Number of racks: 1 FSCK ended at Tue Apr 29 12:18:02 EDT 2014 in 48507 milliseconds ... After the change: CORRUPT FILES: 29 MISSING BLOCKS: 29 MISSING SIZE: 576920501 B CORRUPT BLOCKS: 29 *CORRUPT SIZE: <zyx> B* ... Default replication factor: 3 Average block replication: 2.7412367 * Missing blocks: <xyz>* Corrupt blocks: 0 Missing replicas: 0 (0.0 %) Number of data-nodes: 12 Number of racks: 1 FSCK ended at Tue Apr 29 12:18:02 EDT 2014 in 48507 milliseconds ... Basically we are adding two additional lines to the report (as highlighted above). Theoretically if a tool parses existing fsck report and expects the 'Corrupt blocks" entry to be right after the "Average block replication" entry, then the change would fail the tool. But is this really a concern? I guess this is not really a concern, so I don't think this change is incompatible. but would anyone please comment? Thanks. BTW, thanks Ming Ma for reporting and working on this issue. --Yongjun