Hey.. I am trying to write a small mapreduce program. I launch a few map
tasks, each of which should complete within a certain time (say 5
minutes)... all the tasks complete within 5 minutes except the last one -
which takes around 10 times more the time taken by all other map
tasks.any idea w
Basically, it finishes what it is supposed to do (I view the logs to find
out), but does not move onto the reduce stage.
Ted Dunning wrote:
>
> Is that last map task actually running, or is it pending?
>
> On Tue, Sep 29, 2009 at 5:57 PM, achilles852 wrote:
>
>>
&g
n your map tasks? It might be helpful if you paste the
> logs here after blanking your system specific info., as then one can find
> out where till the reduce it went or if the copy phase started at all.
>
> Thanks,
> Amogh
>
> -Original Message-
> From: achil
sekar-2 wrote:
>
> Hi,
> Can you provide info on the input like compression etc? Also, are you
> using cached files in your map tasks? It might be helpful if you paste the
> logs here after blanking your system specific info., as then one can find
> out where till the reduce it wen
didn't spend the time to know for sure)
>
> thanks,
> thushara
>
> On Wed, Sep 30, 2009 at 1:46 AM, achilles852 wrote:
>
>>
>> Running Hadoop on EC2 using 32 bit small instances with 5 slaves. The job
>> has
>> 5 map tasks and 1 reduce task. Each ma