There are folks that run Kafka Brokers on Apache Mesos.  I don't know of
anyone running Kafka brokers on YARN but if there were I would hope they
chime in.

Without getting into a long debate about Mesos vs YARN I do agree with
cluster resource allocation being an important direction for the industry
as a whole.  There was a discussion some time back about this on the dev
list (maybe last November) and there isn't really anything that Kafka needs
to-do for making this work.  I had started that discussion and a wiki page
and at the end of it was able to get Kafka running without any changes to
it. Each of these resource negotiator systems provide ways for standalone
systems to run on them to avoid custom schedulers from having to be
developed.  Their (Mesos/YARN/etc) success is somewhat dictated by not
forcing application to have to-do much of anything to gain from the
benefits they provide =8^)

If you are interested in running Kafka Brokers on Mesos you can do so using
the Apache Aurora scheduler.  Some scripts that will launch that for you on
a Mesos cluster can be found here
https://github.com/pegasussolutions/borealis (along with the zookeeper
ensemble if you wanted).  I know folks also use Marathon (another Mesos
schedular) for this too but I don't know if those scripts were open sourced
or not but it is possible.

Running consumers on Mesos and/or YARN has become more prevalent (almost
typical) with systems like Spark, Samza and Storm taking up the data
processing from brokers and those systems being launched on Mesos and/or
YARN clusters.

Producers also but that falls a bit more into just running whatever
application that is producing data on Mesos and/or YARN and your being able
to produce to Kafka brokers from within that application.

/*******************************************
 Joe Stein
 Founder, Principal Consultant
 Big Data Open Source Security LLC
 http://www.stealth.ly
 Twitter: @allthingshadoop <http://www.twitter.com/allthingshadoop>
********************************************/


On Wed, Jul 23, 2014 at 5:37 PM, hsy...@gmail.com <hsy...@gmail.com> wrote:

> Hi guys,
>
> Kafka is getting more and more popular and in most cases people run kafka
> as long-term service in the cluster. Is there a discussion of running kafka
> on yarn cluster which we can utilize the convenient configuration/resource
> management and HA.  I think there is a big potential and requirement for
> that.
> I found a project https://github.com/kkasravi/kafka-yarn. But is there a
> official roadmap/plan for this?
>
> Thank you very much!
>
> Best,
> Siyuan
>

Reply via email to