[ https://issues.apache.org/jira/browse/HADOOP-2960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Allen Wittenauer resolved HADOOP-2960. -------------------------------------- Resolution: Won't Fix Closing at won't fix, given the -1. > A mapper should use some heuristics to decide whether to run the combiner > during spills > --------------------------------------------------------------------------------------- > > Key: HADOOP-2960 > URL: https://issues.apache.org/jira/browse/HADOOP-2960 > Project: Hadoop Common > Issue Type: Bug > Reporter: Runping Qi > > Right now, the combiner, if set, will be called for each spill, no mapper > whether the combiner can actually reduce the values. > The mapper should use some heuristics to decide whether to run the combiner > during spills. > One of such heuristics is to check the the ratio of the nymber of keys to > the number of unique keys in the spill. > The combiner will be called only if that ration exceeds certain threshold > (say 2). -- This message was sent by Atlassian JIRA (v6.2#6252)