I think you may be referring to Spark Survey 2015. According to that survey, 
48% use standalone, 40% use YARN and only 11% use Mesos (the numbers don’t add 
up to 100 – probably because of rounding error).

Mohammed
Author: Big Data Analytics with 
Spark<http://www.amazon.com/Big-Data-Analytics-Spark-Practitioners/dp/1484209656/>

From: Igor Berman [mailto:igor.ber...@gmail.com]
Sent: Friday, February 26, 2016 3:52 AM
To: Petr Novak
Cc: user
Subject: Re: Standalone vs. Mesos for production installation on a smallish 
cluster

Imho most of production clusters are standalone
there was some presentation from spark summit with some stats inside(can't find 
right now), so standalone was at 1st place
it was from Matei
https://databricks.com/resources/slides

On 26 February 2016 at 13:40, Petr Novak 
<oss.mli...@gmail.com<mailto:oss.mli...@gmail.com>> wrote:
Hi all,
I believe that it used to be in documentation that Standalone mode is not for 
production. I'm either wrong or it was already removed.

Having a small cluster between 5-10 nodes is Standalone recommended for 
production? I would like to go with Mesos but the question is if there is real 
add-on value for production, mainly from stability perspective.

Can I expect that adding Mesos will improve stability compared to Standalone to 
the extent to justify itself according to somewhat increased complexity?

I know it is hard to answer because Mesos layer itself is going to add some 
bugs as well.

Are there unique features enabled by Mesos specific to Spark? E.g. adaptive 
resources for jobs or whatever?

In the future once cluster will grow and more services running on Mesos, we 
plan to use Mesos. The question is if it does worth to go with it immediately 
even maybe its utility is not directly needed at this point.

Many thanks,
Petr

Reply via email to