Hi

I am not familiar with Azkaban and probably a better question to the
Azkaban community IMO. But there seems to be two modes (
http://azkaban.github.io/azkaban/docs/2.5/) one is solo and one is
two-server mode, but either way I think still SPOF? If there is no
election, just based on process, my 2 cents would be monitor, alert, and
start the process somewhere else. Better yet, don't install the process on
Cassandra node. Keep your instance for one purpose only. If you run cloud
like AWS you will be able to autoscale min1 max1 easily.


Note: In peer-to-peer architecture, there is simply no concept of master.
You can start with some seed nodes for discovery. It depends how you design
discovery.

On Sat, Aug 15, 2015 at 11:49 AM, Vikram Kone <vikramk...@gmail.com> wrote:

> Hi,
> We are planning to install Azkaban in solo server mode on a 24
> node cassandra cluster to be able to schedule spark jobs with intricate
> dependency chain. The problem, is since Cassandra has a no-SPOF
> architecture ie any node can become the master for the cluster, it creates
> the problem for Azkaban master since it's not a peer-peer architecture
> where any node can become the master. Only a single mode has to be master
> at any given time.
>
> What are our options here? Are there any framworks or tools out there that
> would allow any application to run on a cluster of machines with high
> availablity?
> Should I be looking at something like zookeeper for this ? Or Mesos may
> be?

Reply via email to