Hi,
As a follow-up question, do map/reduce tasks run as threads or processes?
On Tue, Feb 22, 2011 at 10:35 AM, bikash sharma wrote:
> Hi,
> Is there any way in which we can measure the resource usage of each
> map/reduce task running?
> I was trying to use sar utility to track each process resou
Hi,
Is there a way to disable the use of pipelining , i.e., the reduce phase is
started only after the map phase is completed?
-bikash
On Tue, Mar 1, 2011 at 3:13 PM, bikash sharma wrote:
> Hi,
> Is there a way to disable the use of pipelining , i.e., the reduce phase is
> started only after the map phase is completed?
I think this is a hardcoded pipeline and can't be changed.
For some recent research work that addresses your q
Hi all
I met a problem when I try to balance certain hdfs directory among the
clusters. For example, I have a directory "/user/xxx/", and there 100
blocks. I want to balance them among my 5 nodes clusters. Each node has 40
blocks (2 replicas). The problem is about transfer block from one datanode
Hello,
On Tue, Mar 1, 2011 at 7:29 PM, bikash sharma wrote:
> Hi,
> As a follow-up question, do map/reduce tasks run as threads or processes?
Every launched Task runs as an independent process, communicating over
a network interface (lo) with the TaskTracker for reporting/etc.
purposes.
--
Har
Actually, that is configurable. Multiple tasks can be run from the
same JVM, and you can also use the MultithreadedMapper
(http://hadoop.apache.org/mapreduce/docs/r0.21.0/api/org/apache/hadoop/mapreduce/lib/map/MultithreadedMapper.html)
to at least make map tasks run in the same thread.
On Tue, Ma
RPC server should log the client hostname when read exception happened
--
Key: HADOOP-7159
URL: https://issues.apache.org/jira/browse/HADOOP-7159
Project: Hadoop Common
Issu