I think what MaGuoWei is looking for is "intra-job" YARN elasticity.

Adding these hooks is going to be important also to trigger streaming jobs
to scale in/out during runtime.

On Tue, Jul 28, 2015 at 3:27 AM, MaGuoWei <maguo...@outlook.com> wrote:

> thanks all you guys.Now I know I can achieve this goal by creating a
> cluster per topology and estimating the cluster size by analyzing the
> JobGraph(or by some user input).But I think it can be more beautiful  if
> Flink can abstract some common resource api such as get/release/deploy. :-)
> thanks.
>
>
>
> > From: rmetz...@apache.org
> > Date: Mon, 27 Jul 2015 14:01:11 +0200
> > Subject: Re: add some new api to the scheduler in the job manager
> > To: dev@flink.apache.org
> >
> > Hi MaGuoWei,
> >
> > would you like to have done automatically by Flink or based on some user
> > input?
> > Adding commands to the ./bin/yarn-session.sh to change the cluster size
> is
> > quite easy. However, reducing the cluster size while a job is running
> will
> > fail the job.
> >
> > Making this automatically is much harder.
> >
> > On Mon, Jul 27, 2015 at 12:07 PM, Maximilian Michels <m...@apache.org>
> wrote:
> >
> > > Hi MaGuoWei,
> > >
> > > If I understand correctly, you're are looking for a way to have a job
> > > manager (master) contentiously running which requests YARN containers
> for
> > > the task managers (workers) on the fly. This is currently not
> supported by
> > > Flink although you can add or remove task managers while the cluster is
> > > running. The job manager distributes task to the task manager which are
> > > available at scheduling time.
> > >
> > > As Aljoscha mentioned, the most flexible way of sharing resources in a
> YARN
> > > environment is to start a per-job cluster for each job.
> > >
> > > Cheers,
> > > Max
> > >
> > > On Mon, Jul 27, 2015 at 11:19 AM, MaGuoWei <maguo...@outlook.com>
> wrote:
> > >
> > > > It is great! Is there any document? I am very interested in this.
> > > > thanks
> > > >
> > > >
> > > > > From: aljos...@apache.org
> > > > > Date: Mon, 27 Jul 2015 05:14:00 +0000
> > > > > Subject: Re: add some new api to the scheduler in the job manager
> > > > > To: dev@flink.apache.org
> > > > >
> > > > > Hi,
> > > > > I think for more details on giving back resources of a running
> cluster
> > > we
> > > > > have to wait for Robert's opinion. In the mean time, you can also
> just
> > > > run
> > > > > a single job that will bring up some yarn containers and then
> release
> > > > them
> > > > > afterward using this:
> > > > >
> > > >
> > >
> https://ci.apache.org/projects/flink/flink-docs-release-0.9/setup/yarn_setup.html#run-a-single-flink-job-on-hadoop-yarn
> > > > >
> > > > > Cheers,
> > > > > Aljoscha
> > > > >
> > > > > On Mon, 27 Jul 2015 at 03:51 MaGuoWei <maguo...@outlook.com>
> wrote:
> > > > >
> > > > > > hi guysFlink must take over all the resources all the time. That
> > > wastes
> > > > > > resources sometime especially  in the sharing cluster.For example
> > > when
> > > > > > using Flink on Yarn the resource can't be returned back to RM
> even if
> > > > no
> > > > > > graph is running.So I want to know: Is it possible to add some
> apis
> > > in
> > > > the
> > > > > > scheduler to request the resource(slot) and release the
> > > resource(slot)?
> > > > > > These apis can be customized in the different circumstance.
> > > > > > thanks a lot!
> > > > > >
> > > > > >
> > > > > >
> > > >
> > > >
> > >
>
>

Reply via email to