Re: replicate() method in BlockManager.scala choosing only one node for replication.

2014-09-12 Thread Kartheek.R
When I see the storage details of the rdd in the webUI, I find that each block is replicated twice and not on a single node. All the nodes in the cluster are hosting some block or the other. Why is this difference?? The trace of replicate() method shows only one node. But, webUI shows multiple nod

replicate() method in BlockManager.scala choosing only one node for replication.

2014-09-11 Thread rapelly kartheek
Hi, I just wanted to see the flow of nodes getting allocated for rdd replication. I see that all the blocks are getting replicated in the same node. I was expecting that each block gets replicated over different nodes. I have a humble three node spark cluster :). Below is the trace of replicate()