Re: flink ml - k-means

2015-04-26 Thread Alexander Alexandrov
Yes, I expect to have one in the next few weeks (the code is actually there, but we need to port it to the Flink ML API). I suggest to follow the JIRA issue in the next weeks to check when this is done: https://issues.apache.org/jira/browse/FLINK-1731 Regards, Alexander PS. Bear in mind that we

Re: Flink - Avro - AvroTypeInfo issue - Index out of bounds exception

2015-04-26 Thread Filip Łęczycki
Hi Stephan, You are right, sorry for not including this in initial mail. I am receiving below information: 04/26/2015 17:13:43 DataSink (Print to System.out)(1/1) switched to FINISHED 04/26/2015 17:13:43 CHAIN DataSource (at org.apache.flink.api.scala.ExecutionEnvironment.createInput(ExecutionEn

how can handle left outer join for any two dataset

2015-04-26 Thread hagersaleh
how can handle left outer join for any two dataset this dataset inlcude any filed number example data set one ExecutionEnvironment env = ExecutionEnvironment.getExecutionEnvironment(); DataSet> customer=env.readCsvFile("/home/hadoop/Desktop/Dataset/customer.csv") .fieldDelimiter('|') .includeFiel

Re: Difference between using a global variable and broadcasting a variable

2015-04-26 Thread Sebastian
Hi Hung, A broadcast variable can also refer to an intermediate result of a Flink computation. Best, Sebastian On 25.04.2015 21:10, HungChang wrote: Hi, What would be the difference between using global variable and broadcasting it? A toy example: // Using global {{... private static int