I am trying to query a huge file with 370 blocks, but it errors out with message of "number of mappers exceeds limit" and my cluster has a "mapred.tasktracker.map.tasks.maximum" set to 50.
I have tried to set parameters such as hive.exec.mappers.max/ mapred.tasktracker.tasks/ apred.tasktracker.map.tasks.maximum through beeswax and seems none of them is effective. I can change "mapred.tasktracker.map.tasks.maximum" and the query can go through, but I really want to limit concurrent number of tasks per job. So any suggestions please? I am running cloudera 4.5.