I'm not familiar with Infiniband, but I can chime in on the Spark part.

There are two kinds of communications in Spark: control plane and data
plane.  Task scheduling / dispatching is control, whereas fetching a block
(e.g. shuffle) is data.


On Tue, Sep 16, 2014 at 4:22 PM, Trident <cw...@vip.qq.com> wrote:

> Thank you for reading this mail.
>
> I'm trying to change the underlying network connection system of Spark to
> support Infiniteband.
>
> 1. I doubt whether ConnectionManager and netty is under construction. It
> seems that they are not usually used.
>

They are used for data plane communication. Broadcast, shuffle, all use
them.



> 2. How much connection payload is carried by akka?
>

Akka is mainly responsible for control, i.e. dispatching tasks, reporting a
block being put into memory to the driver etc.



> 3. When running ./bin/run-example SparkPi   I noticed that the jar file
> has been sent from server to client. It is scary because the jar is big. Is
> it common?
>

How are you going to distribute the jar file if you don't send it? The
workers need to bytecode for those classes you are going to execute.

Reply via email to