Hi, Till:
Issued opened here, https://issues.apache.org/jira/browse/FLINK-9077

On Tue, Mar 27, 2018 at 11:31 PM Till Rohrmann <trohrm...@apache.org> wrote:

> Hi Renjie,
>
> we couldn't take a look at your configuration yet, because the ML filters
> attachments out. You could upload your log to https://gist.github.com/ or
> open a JIRA to which you attach the log.
>
> Cheers,
> Till
>
> On Sun, Mar 25, 2018 at 9:04 AM, Renjie Liu <liurenjie2...@gmail.com>
> wrote:
>
> > Hi:
> > Could you please help to check whether there any mistake in te config? If
> > not, I'll go to file a bug in jira.
> >
> > Ted Yu <yuzhih...@gmail.com> 于 2018年3月23日周五 下午7:16写道:
> >
> > > Renjie:The log didn't go through.
> > > Consider logging a JIRA and attach the log there.
> > > Thanks
> > > -------- Original message --------From: Renjie Liu <
> > > liurenjie2...@gmail.com> Date: 3/23/18  1:38 AM  (GMT-08:00) To:
> > > dev@flink.apache.org Subject: Re: Flip 6 mesos support
> > > Hi, Till:Attached is my log.
> > > I'm also looking into this, could you please assign this bug to me? I'm
> > > also trying to contribute to flink.
> > >
> > > On Fri, Mar 23, 2018 at 4:11 PM Till Rohrmann <trohrm...@apache.org>
> > > wrote:
> > > HI Renjie, could you share the logs with us? This sounds like a bug we
> > >
> > > should fix.
> > >
> > >
> > >
> > > Cheers,
> > >
> > > Till
> > >
> > >
> > >
> > > On Fri, Mar 23, 2018 at 4:42 AM, Renjie Liu <liurenjie2...@gmail.com>
> > > wrote:
> > >
> > >
> > >
> > > > Hi, Till:
> > >
> > > > Has anybody succeeded to deploy flip 6 mode on mesos?
> > >
> > > >
> > >
> > > > I'm testing flip 6 using the master branch and I just can't run jobs.
> > The
> > >
> > > > following are my configurations:
> > >
> > > >
> > >
> > > > *jobmanager.rpc.address: qt9ss.prod.mediav.com
> > >
> > > > <http://qt9ss.prod.mediav.com>*
> > >
> > > > *jobmanager.rpc.port: 6123*
> > >
> > > > *jobmanager.heap.mb: 1024*
> > >
> > > > *taskmanager.heap.mb: 1024*
> > >
> > > > *taskmanager.numberOfTaskSlots: 5*
> > >
> > > > *parallelism.default: 1*
> > >
> > > > *web.port: 8081*
> > >
> > > > *mesos.master: zk://dk71ss.jx.shbt2.qihoo.net:2191
> > >
> > > > <http://dk71ss.jx.shbt2.qihoo.net:2191>,
> dk72ss.jx.shbt2.qihoo.net:2191
> > >
> > > > <http://dk72ss.jx.shbt2.qihoo.net:2191>,dk5ss.jx.shbt2.
> > >
> > > > qihoo.net:2191/mesos
> > >
> > > > <http://dk5ss.jx.shbt2.qihoo.net:2191/mesos>*
> > >
> > > > *mesos.resourcemanager.tasks.container.type: docker*
> > >
> > > > *mesos.resourcemanager.tasks.container.image.name
> > >
> > > > <http://mesos.resourcemanager.tasks.container.image.name>:
> > >
> > > > dk1ss.prod.mediav.com:5000/adq/flink:1.6.0-SNAPSHOT
> > >
> > > > <http://dk1ss.prod.mediav.com:5000/adq/flink:1.6.0-SNAPSHOT>*
> > >
> > > > *mesos.resourcemanager.framework.user: mediav*
> > >
> > > > *mesos.resourcemanager.tasks.cpus: 5*
> > >
> > > > *mesos.resourcemanager.tasks.mem: 10240*
> > >
> > > > *mesos.resourcemanager.framework.name
> > >
> > > > <http://mesos.resourcemanager.framework.name>: Flink*
> > >
> > > > *mesos.failover-timeout: 60*
> > >
> > > >
> > >
> > > > From the mesos side, I can see that when I submit a job, flink master
> > > will
> > >
> > > > request a contianer with 5 cores. But the job submission still fails
> > the
> > >
> > > > following error:
> > >
> > > > *org.apache.flink.runtime.jobmanager.scheduler.
> > >
> > > > NoResourceAvailableException:
> > >
> > > > Could not allocate all requires slots within timeout of 300000 ms.
> > Slots
> > >
> > > > required: 1, slots allocated: 0*
> > >
> > > >
> > >
> > > > My job only requires 1 slot but job manager keeps reporting that no
> > slots
> > >
> > > > avaiable.
> > >
> > > >
> > >
> > > > On Wed, Mar 21, 2018 at 10:42 PM Till Rohrmann <trohrm...@apache.org
> >
> > >
> > > > wrote:
> > >
> > > >
> > >
> > > > > The resources consumed by the JobMaster can be specified by
> > >
> > > > > `jobmanager.heap.mb`.
> > >
> > > > >
> > >
> > > > > Cheers,
> > >
> > > > > Till
> > >
> > > > >
> > >
> > > > > On Wed, Mar 21, 2018 at 3:20 PM, Renjie Liu <
> liurenjie2...@gmail.com
> > >
> > >
> > > > > wrote:
> > >
> > > > >
> > >
> > > > > > Hi, Till:
> > >
> > > > > >
> > >
> > > > > > In fact, I want to ask the resources consume by job manager
> > >
> > > > > >
> > >
> > > > > > Till Rohrmann <trohrm...@apache.org> 于 2018年3月21日周三 下午8:17写道:
> > >
> > > > > >
> > >
> > > > > > > As many as the application needs to run. If you start a job
> with
> > >
> > > > > > > parallelism 10 then it will ask for 10 slots (assuming slot
> > > sharing).
> > >
> > > > > > >
> > >
> > > > > > > On Wed, Mar 21, 2018 at 12:04 PM, Renjie Liu <
> > >
> > > > liurenjie2...@gmail.com>
> > >
> > > > > > > wrote:
> > >
> > > > > > >
> > >
> > > > > > > > So how many slots a job manager may consume?
> > >
> > > > > > > >
> > >
> > > > > > > > On Wed, Mar 21, 2018 at 6:50 PM Till Rohrmann <
> > >
> > > > trohrm...@apache.org>
> > >
> > > > > > > > wrote:
> > >
> > > > > > > >
> > >
> > > > > > > > > At the moment this is not possible. In order to do this,
> you
> > > will
> > >
> > > > > > have
> > >
> > > > > > > to
> > >
> > > > > > > > > use the per-job mode and run each job on a dedicated Flink
> > >
> > > > cluster.
> > >
> > > > > > > > >
> > >
> > > > > > > > > On Wed, Mar 21, 2018 at 11:33 AM, Renjie Liu <
> > >
> > > > > > liurenjie2...@gmail.com>
> > >
> > > > > > > > > wrote:
> > >
> > > > > > > > >
> > >
> > > > > > > > > > For example, we have 2 jobs.
> > >
> > > > > > > > > > For job 1, I want to start job manger with 1 CPU and 100M
> > >
> > > > memory.
> > >
> > > > > > > Job 1
> > >
> > > > > > > > > > need s10 slots, and I want to deploy these 10 slots in 2
> > task
> > >
> > > > > > > managers,
> > >
> > > > > > > > > > each with 5 cores and 1G memory.
> > >
> > > > > > > > > >
> > >
> > > > > > > > > > For job 2, I want to start job manager with 2 CPU and
> 200M
> > >
> > > > > memory.
> > >
> > > > > > > Job
> > >
> > > > > > > > 2
> > >
> > > > > > > > > > needs 100 slots and I want to deploy these 100 slot in 10
> > > task
> > >
> > > > > > > > managers,
> > >
> > > > > > > > > > each with 10 cores and 2G memory.
> > >
> > > > > > > > > >
> > >
> > > > > > > > > > Is this possible?
> > >
> > > > > > > > > >
> > >
> > > > > > > > > > On Wed, Mar 21, 2018 at 6:19 PM Till Rohrmann <
> > >
> > > > > > trohrm...@apache.org>
> > >
> > > > > > > > > > wrote:
> > >
> > > > > > > > > >
> > >
> > > > > > > > > > > Hi Renjie,
> > >
> > > > > > > > > > >
> > >
> > > > > > > > > > > what do you mean with specifying different JM and TM
> > >
> > > > resources
> > >
> > > > > > for
> > >
> > > > > > > > > > > different jobs exactly?
> > >
> > > > > > > > > > >
> > >
> > > > > > > > > > > Cheers,
> > >
> > > > > > > > > > > Till
> > >
> > > > > > > > > > >
> > >
> > > > > > > > > > > On Wed, Mar 21, 2018 at 10:55 AM, Renjie Liu <
> > >
> > > > > > > > liurenjie2...@gmail.com>
> > >
> > > > > > > > > > > wrote:
> > >
> > > > > > > > > > >
> > >
> > > > > > > > > > > > Hi, Till:
> > >
> > > > > > > > > > > >
> > >
> > > > > > > > > > > > How to specify job manager and task manager resources
> > for
> > >
> > > > > > > different
> > >
> > > > > > > > > > jobs
> > >
> > > > > > > > > > > in
> > >
> > > > > > > > > > > > session mode?
> > >
> > > > > > > > > > > >
> > >
> > > > > > > > > > > > On Sun, Mar 18, 2018 at 1:10 AM Till Rohrmann <
> > >
> > > > > > > > trohrm...@apache.org>
> > >
> > > > > > > > > > > > wrote:
> > >
> > > > > > > > > > > >
> > >
> > > > > > > > > > > > > Hi Shuyi,
> > >
> > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > best if you look at the other e2e tests in the
> > >
> > > > > > > > > flink-end-to-end-tests
> > >
> > > > > > > > > > > > > module. For example the Kafka e2e test under
> > >
> > > > > > > > > > > > > flink/flink-end-to-end-tests/
> > >
> > > > test-scripts/test_streaming_
> > >
> > > > > > > > > > kafka010.sh.
> > >
> > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > Cheers,
> > >
> > > > > > > > > > > > > Till
> > >
> > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > On Fri, Mar 16, 2018 at 10:20 PM, Shuyi Chen <
> > >
> > > > > > > suez1...@gmail.com
> > >
> > > > > > > > >
> > >
> > > > > > > > > > > wrote:
> > >
> > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > Hi Till,
> > >
> > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > For FLINK-8562, the test is passing now because
> > it's
> > >
> > > > not
> > >
> > > > > > > really
> > >
> > > > > > > > > > > > > > checking the right thing.
> > >
> > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > Yes, I can help with the Kerberos integration
> > ticket.
> > >
> > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > Is there an example on how the e2e test should be
> > >
> > > > > > structured
> > >
> > > > > > > > and
> > >
> > > > > > > > > > > > invoked?
> > >
> > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > Thanks
> > >
> > > > > > > > > > > > > > Shuyi
> > >
> > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > On Fri, Mar 16, 2018 at 6:51 AM, Till Rohrmann <
> > >
> > > > > > > > > > trohrm...@apache.org
> > >
> > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > wrote:
> > >
> > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > > Hi Shuyi,
> > >
> > > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > > thanks for the working on FLINK-8562. Once this
> > > issue
> > >
> > > > > is
> > >
> > > > > > > > fixed,
> > >
> > > > > > > > > > it
> > >
> > > > > > > > > > > > will
> > >
> > > > > > > > > > > > > > > automatically be executed on the Flip-6
> > components.
> > >
> > > > In
> > >
> > > > > > fact
> > >
> > > > > > > > it
> > >
> > > > > > > > > is
> > >
> > > > > > > > > > > > > already
> > >
> > > > > > > > > > > > > > > being executed on Flip-6.
> > >
> > > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > > But what you could help the community with is
> > > setting
> > >
> > > > > up
> > >
> > > > > > an
> > >
> > > > > > > > > > > automated
> > >
> > > > > > > > > > > > > > > end-to-end test for the Kerberos integration if
> > you
> > >
> > > > > want:
> > >
> > > > > > > > > > > > > > >
> https://issues.apache.org/jira/browse/FLINK-8981
> > .
> > >
> > > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > > The Flink community is currently working on
> > >
> > > > automating
> > >
> > > > > > more
> > >
> > > > > > > > and
> > >
> > > > > > > > > > > more
> > >
> > > > > > > > > > > > > > tests
> > >
> > > > > > > > > > > > > > > in order to facilitate faster releases and
> > improve
> > >
> > > > the
> > >
> > > > > > test
> > >
> > > > > > > > > > > coverage.
> > >
> > > > > > > > > > > > > You
> > >
> > > > > > > > > > > > > > > can find more about this effort here:
> > >
> > > > > > > > > > > > > > >
> https://issues.apache.org/jira/browse/FLINK-8970
> > .
> > >
> > > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > > Cheers,
> > >
> > > > > > > > > > > > > > > Till
> > >
> > > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > > On Thu, Mar 15, 2018 at 8:45 PM, Shuyi Chen <
> > >
> > > > > > > > > suez1...@gmail.com>
> > >
> > > > > > > > > > > > > wrote:
> > >
> > > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > > > Hi Till,
> > >
> > > > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > > > This is Shuyi :) Thanks a lot. In
> FLINK-8562, I
> > >
> > > > > already
> > >
> > > > > > > > sent
> > >
> > > > > > > > > a
> > >
> > > > > > > > > > PR
> > >
> > > > > > > > > > > > to
> > >
> > > > > > > > > > > > > > > > resolve the issue, your help to take a look
> > will
> > > be
> > >
> > > > > > > great.
> > >
> > > > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > > > Please let me know what I can help to test
> the
> > >
> > > > > Kerberos
> > >
> > > > > > > > > > > > > > authentication, I
> > >
> > > > > > > > > > > > > > > > am decently familiar with the Kerberos and
> YARN
> > >
> > > > > > security
> > >
> > > > > > > > part
> > >
> > > > > > > > > > in
> > >
> > > > > > > > > > > > > Flink.
> > >
> > > > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > > > As a starting point, I'd suggest to add an
> > >
> > > > > integration
> > >
> > > > > > > test
> > >
> > > > > > > > > > > similar
> > >
> > > > > > > > > > > > > to
> > >
> > > > > > > > > > > > > > > > YARNSessionFIFOSecuredITCase
> > >
> > > > > > > > > > > > > > > > for flip6.
> > >
> > > > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > > > Shuyi
> > >
> > > > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > > > On Thu, Mar 15, 2018 at 5:44 AM, Till
> Rohrmann
> > <
> > >
> > > > > > > > > > > > trohrm...@apache.org
> > >
> > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > > > wrote:
> > >
> > > > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > > > > Hi Renjie,
> > >
> > > > > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > > > > thanks for the pointer with the
> > >
> > > > > > > > > YARNSessionFIFOSecuredITCase.
> > >
> > > > > > > > > > > > > You're
> > >
> > > > > > > > > > > > > > > > right
> > >
> > > > > > > > > > > > > > > > > that we should fix this test. There is
> > > FLINK-8562
> > >
> > > > > > which
> > >
> > > > > > > > > seems
> > >
> > > > > > > > > > > to
> > >
> > > > > > > > > > > > > > > address
> > >
> > > > > > > > > > > > > > > > > the problem. Will take a look.
> > >
> > > > > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > > > > Additionally, we want to test Kerberos
> > >
> > > > > authentication
> > >
> > > > > > > > > > > explicitly
> > >
> > > > > > > > > > > > as
> > >
> > > > > > > > > > > > > > > part
> > >
> > > > > > > > > > > > > > > > of
> > >
> > > > > > > > > > > > > > > > > the release testing for Flink 1.5. I will
> > > shortly
> > >
> > > > > > send
> > >
> > > > > > > > > > around a
> > >
> > > > > > > > > > > > > mail
> > >
> > > > > > > > > > > > > > > > where
> > >
> > > > > > > > > > > > > > > > > I will lay out the ongoing testing efforts
> > and
> > >
> > > > > where
> > >
> > > > > > > more
> > >
> > > > > > > > > is
> > >
> > > > > > > > > > > > > needed.
> > >
> > > > > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > > > > Cheers,
> > >
> > > > > > > > > > > > > > > > > Till
> > >
> > > > > > > > > > > > > > > > >
> > >
> > > > > > > > > > > > > > > > > On Thu, Mar 15, 2018 at 7:37 AM, Renjie
> Liu <
> > >
> > > > > > > > > > > > > liurenjie2...@gmail.com
> > >
> > > > > > > > > > > > > > >
> > >
> > > > > > > > > > &
> >
> > --
> > Liu, Renjie
> > Software Engineer, MVAD
> >
>
-- 
Liu, Renjie
Software Engineer, MVAD

Reply via email to