Wrong description of Block-Compressed SequenceFile Format in SequenceFile's javadoc -----------------------------------------------------------------------------------
Key: HADOOP-7048 URL: https://issues.apache.org/jira/browse/HADOOP-7048 Project: Hadoop Common Issue Type: Improvement Components: io Affects Versions: 0.21.0 Reporter: Jingguo Yao Priority: Minor Here is the following description for Block-Compressed SequenceFile Format in SequenceFile's javadoc: * <li> * Record <i>Block</i> * <ul> * <li>Compressed key-lengths block-size</li> * <li>Compressed key-lengths block</li> * <li>Compressed keys block-size</li> * <li>Compressed keys block</li> * <li>Compressed value-lengths block-size</li> * <li>Compressed value-lengths block</li> * <li>Compressed values block-size</li> * <li>Compressed values block</li> * </ul> * </li> * <li> * A sync-marker every few <code>100</code> bytes or so. * </li> This description misses "Uncompressed record number in the block". And "A sync-marker every few <code>100</code> bytes or so" is not the case for Block-Compressed SequenceFile Format. Correct description should be: * <li> * Record <i>Block</i> * <ul> * <li>Uncompressed record number in the block</li> * <li>Compressed key-lengths block-size</li> * <li>Compressed key-lengths block</li> * <li>Compressed keys block-size</li> * <li>Compressed keys block</li> * <li>Compressed value-lengths block-size</li> * <li>Compressed value-lengths block</li> * <li>Compressed values block-size</li> * <li>Compressed values block</li> * </ul> * </li> * <li> * A sync-marker every block. * </li> -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.