Hi Robert and Max, thanks for giving us some feedback to ponder on. We'll start by opening the issue and some sub-tasks to keep track of the prototyping and and design phase, along with development, testing and documentation.
Robert, I thought of the encryption issue and was one of the things I hoped would have come up in this thread; it's good to know it's already on the TODO list; we're currently studying this issue and perhaps we can collaborate on this or even develop it entirely. On Thu, Mar 24, 2016 at 10:38 AM, Robert Metzger <rmetz...@apache.org> wrote: > Hi Stefano, > > I think the proposed feature is not limited to YARN sessions. With the code > in place, also standalone clusters would allow us to authenticate file > system access with the user who submitted the job. > > I would recommend you to do some prototyping and come up with a design > document first. The change has quite some implications. > Some things that come into my mind: > - Filesystem implementations are currently instantiated once. I think we > would need to securely instantiate filesystems per user (imagine multiple > users on Flink) (That's why the YARN session user owns the file system / > Hbase access) > - There is currently no over-the-write encryption (its on my TODO list) in > Flink, so how do you transfer the security tokens? (YARN is currently doing > that for us. so we don't need to worry about that) > > There are probably more implications you'll find while implementing this. > > > On Wed, Mar 23, 2016 at 7:06 PM, Maximilian Michels <m...@apache.org> > wrote: > > > Hi Stefano, > > > > Sounds great. Please go ahead! Note that Flink already provides the > > proposed feature for per-job Yarn clusters. However, it is a valuable > > addition to realize this feature for the Yarn session. > > > > The only blocker that I can think of is probably this PR which changes > > a lot of the Yarn classes: https://github.com/apache/flink/pull/1741 > > There are also changes planned for the client side to decouple the > > Yarn support from the job submission process and make it easier to > > integrate other frameworks (like Mesos). I don't think that will block > > your contribution since a lot of the logic is probably going to be > > contained in separate classes which can be integrated even when code > > changes. Let's just stay in sync. > > > > If you like, you could start off by opening an issue and submitting a > > short design document. > > > > Cheers, > > Max > > > > On Wed, Mar 23, 2016 at 3:55 PM, Stefano Baghino > > <stefano.bagh...@radicalbit.io> wrote: > > > Hello everybody, > > > > > > some of us at Radicalbit spent the last few weeks experimenting to > > improve > > > the understanding of the compatibility of Flink with secure cluster > > > environments and with Kerberos in particular. > > > > > > We’ve found a possible area of improvement and would like to work on it > > as > > > part of our effort to contribute to Flink in the open: after a few > tests > > > and a short exchange on the user mailing list > > > < > > > http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Kerberos-on-YARN-delegation-or-proxying-tp5315p5318.html > > > > > > we’ve come to realize that currently a long-running session on YARN > acts > > on > > > behalf of the user that originally ran the session, not the one who’s > > > submitting the job. > > > > > > We think it would be a nice improvement to be able to have a single > Flink > > > session that keeps running with several users submitting their jobs > with > > > their own credentials. > > > > > > We’d like to develop, test and document this improvement. > > > > > > Do you think this is feasible? Are there any blockers we should be > aware > > of > > > before undertaking this task? Would this be something of interest for > the > > > community? Are there any other ongoing efforts that aim toward this? > > > > > > We’d love to have the feedback of the community on this, thank you in > > > advance to anyone who’s willing to share their insight and opinion with > > us. > > > > > > -- > > > BR, > > > Stefano Baghino > > > > > > Software Engineer @ Radicalbit > > > -- BR, Stefano Baghino Software Engineer @ Radicalbit