How are you concluding that the intermediate output is compressed from the map, but not in the reduce? -C

my hadoop-site.xml

<property>
 <name>mapred.compress.map.output</name>
 <value>true</value>
 <description>Should the job outputs be compressed?
 </description>
</property>
<property>
 <name>mapred.output.compression.type</name>
 <value>BLOCK</value>
<description>If the job outputs are to compressed as SequenceFiles, how should
              they be compressed? Should be one of NONE, RECORD or BLOCK.
 </description>
</property>


from the job.xml

mapred.output.compress = false // final output
mapred.compress.map.output = true // map output

+ I can head the files from comand line and read the key / value in the reduce intermediate merges but not the map.out files.


Reply via email to