I am splitting an integer array in 2 partitions and using an accumulator  to 
sum the array. problem is

1.  I am not seeing execution time becoming half of a linear summing.

2. The second node (from looking at timestamps) takes 3 times as long as the 
first node. This gives the impression it is "waiting" for the first node to 
finish.

Hence,  I am given the impression using accumulator.sum () in the kernel and 
rdd.foreach (kernel) is making things sequential. 

Any api/setting suggestions where I could make things parallel ?


 

Reply via email to