Yes, I expect to have one in the next few weeks (the code is actually there, but we need to port it to the Flink ML API). I suggest to follow the JIRA issue in the next weeks to check when this is done:
https://issues.apache.org/jira/browse/FLINK-1731 Regards, Alexander PS. Bear in mind that we will start with a vanilla implementation of K-Means. For a thorough evaluation you might want to also check variants like K-Means++. 2015-04-24 15:08 GMT+02:00 Pa Rö <paul.roewer1...@googlemail.com>: > hi flink community, > > at the time I write my master thesis in the field machine learning. My > main task is to evaluated different k-means variants for large data sets > (BigData). I would like test flink ml against Apache Mahout and Apache > Hadoop MapReduce in areas of scalability and performance(time and space). > What is the current state for the purpose of clustering, especially > K-Means? Will there be in the near future a release information this? > > best greetings > paul >