RE: Flink performance tuning

2016-05-17 Thread Serhiy Boychenko
sent your data in Flink? (Are you using the TupleX types? or POJOs?) How do you select the key for the grouping? Regards, Robert On Fri, May 13, 2016 at 11:25 AM, Serhiy Boychenko mailto:serhiy.boyche...@cern.ch>> wrote: Hey, I have successfully integrated Flink into our very small test c

Flink performance tuning

2016-05-13 Thread Serhiy Boychenko
Hey, I have successfully integrated Flink into our very small test cluster (3 machines with 8 cores, 8GBytes of memory and 2x1TB disks). Basically I am started the session to use YARN as RM and the data is being read from HDFS. /yarn-session.sh -n 21 -s 1 -jm 1024 -tm 1024 My code is very simpl

RE: Running Flink jobs directly from Eclipse

2016-04-06 Thread Serhiy Boychenko
se: ExecutionEnvironment env = ExecutionEnvironment.createRemoteEnvironment("xx.xx.xx.xx", 40631, "target/FlinkTest-0.0.1-SNAPSHOT-jar-with-dependencies.jar"); The JAR need to be compiled before. Hope it helps! Christophe 2016-04-06 9:25 GMT+02:00 Serhiy Boychenko mailto:serhiy.boyche...@cern.ch>>: Cheerz, I

Running Flink jobs directly from Eclipse

2016-04-06 Thread Serhiy Boychenko
Cheerz, I have been working last few month on the comparison of different data processing engines and recently came across Apache Flink. After reading different academic papers on comparison of Flink with other data processing I would definitely give it a shot. The only issue I am currently hav