I saw this in hadoop wiki: http://wiki.apache.org/hadoop/HowManyMapsAndReduces
But in my experiment,I see the different result. When I set the CombineHiveInputFormat in hive and by the doc, the default block should be 64M, but my input files are more than 64M, hadoop still created one map task to handle all data. Can you help to figure out where is wrong? R