How are you concluding that the intermediate output is compressed from
the map, but not in the reduce? -C
my hadoop-site.xml
<property>
<name>mapred.compress.map.output</name>
<value>true</value>
<description>Should the job outputs be compressed?
</description>
</property>
<property>
<name>mapred.output.compression.type</name>
<value>BLOCK</value>
<description>If the job outputs are to compressed as SequenceFiles, how
should
they be compressed? Should be one of NONE, RECORD or BLOCK.
</description>
</property>
from the job.xml
mapred.output.compress = false // final output
mapred.compress.map.output = true // map output
+ I can head the files from comand line and read the key / value in the
reduce intermediate merges but not the map.out files.