Re: Sharded Service Coordination

Joe Stein Fri, 11 Jul 2014 14:25:30 -0700

+1 for adding it to Aurora, looking forward to using it, thanks!!!!

/*******************************************
 Joe Stein
 Founder, Principal Consultant
 Big Data Open Source Security LLC
 http://www.stealth.ly
 Twitter: @allthingshadoop <http://www.twitter.com/allthingshadoop>
********************************************/



On Wed, Jul 9, 2014 at 6:48 PM, Oliver, James <james.oli...@pegs.com> wrote:

> *Nudge nudge*
>
> We would love to take advantage of this tool if you folks would be so kind!
>
> Thanks,
> James
>
>
> On 7/9/14 1:35 PM, "Bill Farner" <wfar...@apache.org> wrote:
>
> >FWIW at Twitter we do something that sounds like a mix between (1) and
> >(2).
> > We consume the (otherwise unused) Announcer configuration parameter [1]
> >to
> >Job, and extend the executor to publish all allocated {{thermos.port[x]}}s
> >to ZooKeeper.  This is something we would love to open source, so please
> >nudge us to do so if you would like this feature!
> >
> >[1]
> >
> https://github.com/apache/incubator-aurora/blob/master/src/main/python/apa
> >che/aurora/config/schema/base.py#L69
> >
> >-=Bill
> >
> >
> >On Wed, Jul 9, 2014 at 10:24 AM, Oliver, James <james.oli...@pegs.com>
> >wrote:
> >
> >> Good morning,
> >>
> >> My company is in the process of adopting Apache's open source stack, and
> >> I've been tasked with building Aurora jobs to deploy a few popular open
> >> source technologies on Mesos. Aurora is an elegant scheduler and in our
> >> estimation will met our company's needs. However, we are struggling to
> >>meet
> >> some of the configuration requirements of some of the tools we wish to
> >> deploy.
> >>
> >> Scenario: When a distributed service is deployed, we need to
> >> programmatically determine all hosts selected for deployment and their
> >> reserved ports in order to properly configure the service. We've solved
> >> this in a few not-so-elegant ways:
> >>
> >>  1.  We wrote a Process to publish host/port information to a
> >>distributed
> >> file system, block until {{instances}} were written, read the
> >>information
> >> and finally configure the service. This works, but IMO it is not an
> >>elegant
> >> solution.
> >>  2.  Next, we designed a REST API for service registration (based off of
> >> the Aurora job key) and published this information to our ZooKeeper
> >> ensemble. This solution removes the dependency on a pre-configured
> >> distributed file system. However, some overhead is still required
> >>(multiple
> >> instances are necessary so as to not introduce a point of failure).
> >>Aurora
> >> jobs now require some initial configuration to be able to communicate
> >>with
> >> the service. Communication to this API is a little non-trivial because
> >>the
> >> REST service doesn't block until all information is communicated  this
> >> would be problematic due to the nature of the HTTP protocol.
> >>
> >> At this point, we realized that an even better solution might be to
> >> communicate directly to Aurora scheduler to get this data. Seeing as
> >>Aurora
> >> scripts are just Python, we could probably implement it in a reusable
> >> fashionŠbut I'm curious if anyone has already gone down this path?
> >>
> >> Thank you,
> >> James O
> >>
>
>

Re: Sharded Service Coordination

Reply via email to