Re: Kerberos on YARN: delegation or proxying?

2016-03-06 Thread Stefano Baghino
Ok, thank you for the very detailed explanation! On Sun, Mar 6, 2016 at 10:02 PM, Maximilian Michels wrote: > Hi Stefano, > > That is currently a limitation of the Kerberos implementation. The > Kerberos authentication is performed only once the Flink cluster is > brought up. The Yarn session is

Re: Kerberos on YARN: delegation or proxying?

2016-03-06 Thread Maximilian Michels
Hi Stefano, That is currently a limitation of the Kerberos implementation. The Kerberos authentication is performed only once the Flink cluster is brought up. The Yarn session is then tight to a particular user's ticket. Note, that you need at least Hadoop version 2.6.1 or higher to run long-runni

Re: Kerberos on YARN: delegation or proxying?

2016-03-06 Thread Stefano Baghino
One last note: initially I tried to run the session as the same OS user, running kdestroy and then kinit with the other user, having this error. Trying to run the job in a different OS session, authenticating with Kerberos as the user who should run the job, I can't connect to the JobManager. I've

Re: Kerberos on YARN: delegation or proxying?

2016-03-06 Thread Stefano Baghino
In the initial description, I meant "I'm trying to access a private folder of the latter", so not the service account. Sorry for the mistake. On Sun, Mar 6, 2016 at 8:54 PM, Stefano Baghino < stefano.bagh...@radicalbit.io> wrote: > Hello everybody, > > I'm running some tests on how Flink as a lon

Kerberos on YARN: delegation or proxying?

2016-03-06 Thread Stefano Baghino
Hello everybody, I'm running some tests on how Flink as a long-running YARN session handles security with Kerberos. In particular, I'm running a test where I run Flink on YARN with a service account and then deploy a job via CLI as another user; in the job I'm trying to access a private folder of

Re: SourceFunction Scala

2016-03-06 Thread Márton Balassi
Hey Ankur, Add the following line to your imports, and have a look at the referenced FAQ. [1] import org.apache.flink.streaming.api.scala._ [1] https://flink.apache.org/faq.html#in-scala-api-i-get-an-error-about-implicit-values-and-evidence-parameters Best, Marton On Sun, Mar 6, 2016 at 8:17

SourceFunction Scala

2016-03-06 Thread Ankur Sharma
Hello, I am trying to use a custom source function (declaration given below) for DataStream. if I add the source to stream using add source: val stream = env.addSource(new QueryOneSource(args)) I get following error: Any explanations and help ?? Error:(14, 31) could not find implicit value fo

Re: Retrieve elements from the Dataset without using collect

2016-03-06 Thread subash basnet
Hello Konstantin, Yup thanks. Best Regards, Subash Basnet On Sun, Mar 6, 2016 at 7:20 PM, Konstantin Knauf < konstantin.kn...@tngtech.com> wrote: > [image: Boxbe] This message is eligible > for Automatic Cleanup! (konstantin.kn...@tngtech.com) Add cleanup rule

Re: Retrieve elements from the Dataset without using collect

2016-03-06 Thread Konstantin Knauf
Hi Subash, I think Dataset.first(int n) is what you are looking for. Cheers, Konstantin On 06.03.2016 19:10, subash basnet wrote: > Hello all, > > My requirement is to get suppose top '10' elements from the DataSet as > another DataSet. How would I do that without using collect. > Eg: > *Dat

Retrieve elements from the Dataset without using collect

2016-03-06 Thread subash basnet
Hello all, My requirement is to get suppose top '10' elements from the DataSet as another DataSet. How would I do that without using collect. Eg: *DataSet> counts =* *data.flatMap(new Tokenizer());* I want a new DataSet containing 10 elements of *counts*. And, what would be the way to retrieve i