Re: Map Reduce in heterogeneous environ..

Steve Loughran Thu, 11 Mar 2010 04:26:07 -0800

abhishek sharma wrote:

No. of slots per task tracker cannot be varied so even if some nodes
have additional cores, extra slots cannot be added.


True. This is what I have been wishing for;-) I routinely use clusters
where some machines have 8 while others have 4 cores.

Varying the #of task slots per node is trivial. Every TT reports the #ofavaialable slots. Therefore you need a separate config file for everyclass of node in your cluster, set themapred.tasktracker.map.tasks.maximum andmapred.tasktracker.reduce.tasks.maximum values to the limits for thosemachines, push out the right config file to the right target machines.

If you don't have a way of providing different configurations todifferent machines in your cluster, the problem lies with yourconfiguration management tooling/policy, not Hadoop.

What we dont have (today) is the ability of a live TT to vary its slotsbased on other system information, so if the machine is also acceptingworkloads from some grid scheduler the TT can't look at the number oflive grid jobs or the IO load and use that to reduce its slot count.Contributions there would be welcomed by those people that share computenodes on different workloads.


-steve

Re: Map Reduce in heterogeneous environ..

Reply via email to