[DISCUSS] Secure Flink clusters

2016-05-11 Thread Wright, Eron
Hello! There’s been a few discussions lately on how to improve the Kerberos support in Flink. I’ve drafted a design document that lays out a plan to support keytab-based authentication for HDFS, Kafka, and ZooKeeper. In addition, the plan contemplates secure, TLS-based communication between c

RequiredParameters vs ParameterTool

2016-05-11 Thread Flavio Pompermaier
Hi flinkers, I was converting my parameter validation code to the new (and very useful) RequiredParameters APIs. However I faced some issue with its semantic and I wanted to report my doubts to the community and see whether those APIs coould be improved or not. I started using RequiredParameters t

Re: [VOTE] Release Apache Flink 1.0.3 (RC3)

2016-05-11 Thread Ufuk Celebi
+1 - Verified signatures and hashes - Built from source (mvn clean install) - Checked the Java API quickstart - Verified checkpoint stats tracker configuration fix (FLINK-3803) - Verified no new dependencies On Tue, May 10, 2016 at 12:36 PM, Fabian Hueske wrote: > Forwarding my vote from RC2. >

[jira] [Created] (FLINK-3898) Adamic-Adar Similarity

2016-05-11 Thread Greg Hogan (JIRA)
Greg Hogan created FLINK-3898: - Summary: Adamic-Adar Similarity Key: FLINK-3898 URL: https://issues.apache.org/jira/browse/FLINK-3898 Project: Flink Issue Type: New Feature Components:

[RESULT] [VOTE] Release Apache Flink 1.0.3 (RC3)

2016-05-11 Thread Ufuk Celebi
This vote has passed with 3 binding +1 votes. Thanks to everyone who contributed and tested the release candidate. +1s: Gyula Fora (binding) Fabian Hueske (binding) Ufuk Celebi (binding) There are no 0s or -1s. I'll go ahead finalize and package this release. On Mon, May 9, 2016 at 10:24 AM, Uf

Re: [RESULT] [VOTE] Release Apache Flink 1.0.3 (RC3)

2016-05-11 Thread Fabian Hueske
Thanks Ufuk! 2016-05-11 16:39 GMT+02:00 Ufuk Celebi : > This vote has passed with 3 binding +1 votes. Thanks to everyone who > contributed and tested the release candidate. > > +1s: > Gyula Fora (binding) > Fabian Hueske (binding) > Ufuk Celebi (binding) > > There are no 0s or -1s. > > I'll go ah

Re: [RESULT] [VOTE] Release Apache Flink 1.0.3 (RC3)

2016-05-11 Thread Stephan Ewen
Thanks for pushing this release Ufuk! On Wed, May 11, 2016 at 5:12 PM, Fabian Hueske wrote: > Thanks Ufuk! > > 2016-05-11 16:39 GMT+02:00 Ufuk Celebi : > > > This vote has passed with 3 binding +1 votes. Thanks to everyone who > > contributed and tested the release candidate. > > > > +1s: > > Gy

[QUESTION] thread model in Flink makes me confused

2016-05-11 Thread Jark Wu
As I know, Flink uses thread model, that means one TaskManager process may run  many different operator threads from different jobs. So tasks from different  jobs will compete for memory and CPU in the one process. In the worst case  scenario, the bad job will eat most of CPU and memroy which may 

Re: RequiredParameters vs ParameterTool

2016-05-11 Thread Aljoscha Krettek
I think RequiredParameters is meant to be used with ParameterTool. For example, check out RequiredParametersTest. On Wed, 11 May 2016 at 11:05 Flavio Pompermaier wrote: > Hi flinkers, > I was converting my parameter validation code to the new (and very useful) > RequiredParameters APIs. However

Re: RequiredParameters vs ParameterTool

2016-05-11 Thread Flavio Pompermaier
I know how it works, I just wanted to point out that that part could be improved ad little bit since there are redundant functions and maybe the name of the RequiredParameters is somehow misleading...maybe ParameterToolHelper or ParameterToolUtils would be better... On 11 May 2016 19:00, "Aljoscha

How to specify dependencies for an application that needs to use modified version of Flink

2016-05-11 Thread Saiph Kappa
Hi, I'm performing some modifications on Flink (current trunk version). I want a scala app (sbt based) to use that modified version. I'm only modifying the flink-streaming-java module, what is the typical way to specify the dependencies for my application in this case? Should I copy all jars to th

Re: [QUESTION] thread model in Flink makes me confused

2016-05-11 Thread Wright, Eron
One option is to use a separate cluster (JobManager + TaskManagers) for each job. This is fairly straightforward with the YARN support - "flink run” can launch a cluster for a job and tear it down afterwards. Of course this means you must deploy YARN. That doesn’t necessarily imply HDFS tho

Re: How to specify dependencies for an application that needs to use modified version of Flink

2016-05-11 Thread Deepak Jha
Hi Saiph, I'm using a combination of both. A fat jar for application related jar and keeping flink specific jars (eg: flink-scala, flink-clients, flink-connecor) inside the lib. On Wed, May 11, 2016 at 11:33 AM, Saiph Kappa wrote: > Hi, > > I'm performing some modifications on Flink (current tr

Re: [QUESTION] thread model in Flink makes me confused

2016-05-11 Thread Aljoscha Krettek
I favor the one-cluster-per job approach. If this becomes the dominant approach to doing things we could also think about introducing a separate component that would allow monitoring the jobs in these per-job clusters as is now possible when running multiple jobs in a single cluster. On Thu, 12 Ma