Thank you for grateful
I know I can start spark-shell by launching the shell itself
spark-shell
Now I know that in standalone mode I can also connect to master
spark-shell --master spark://<HOST>:7077
My point is what are the differences between these two start-up modes for
spark-shell? If I start spark-shell and connect to master what performance gain
will I get if any or it does not matter. Is it the same as for spark-submit
regards
On Saturday, 11 June 2016, 19:39, Mohammad Tariq <[email protected]> wrote:
Hi Ashok,
In local mode all the processes run inside a single jvm, whereas in standalone
mode we have separate master and worker processes running in their own jvms.
To quickly test your code from within your IDE you could probable use the local
mode. However, to get a real feel of how Spark operates I would suggest you to
have a standalone setup as well. It's just the matter of launching a standalone
cluster either manually(by starting a master and workers by hand), or by using
the launch scripts provided with Spark package.
You can find more on this here.
HTH
| |
|
| Tariq, Mohammad
| about.me/mti |
|
| |
|
| |
On Sat, Jun 11, 2016 at 11:38 PM, Ashok Kumar <[email protected]>
wrote:
Hi,
What is the difference between running Spark in Local mode or standalone mode?
Are they the same. If they are not which is best suited for non prod work.
I am also aware that one can run Spark in Yarn mode as well.
Thanks