Ziyad Muhammed Mohiyudheen created FLINK-5888:
-------------------------------------------------

             Summary: ForwardedFields annotation is not generating optimised 
execution plan in example KMeans job
                 Key: FLINK-5888
                 URL: https://issues.apache.org/jira/browse/FLINK-5888
             Project: Flink
          Issue Type: Bug
          Components: DataSet API, Examples, Java API
    Affects Versions: 1.1.3
            Reporter: Ziyad Muhammed Mohiyudheen


Flink KMeans java example [1] shows the usage of ForwardedFields function 
annotation. How ever, the example job was taking more time than expected on 
medium sized data itself. By merely removing the function annotation from the 
example code (with out any other change), a better execution plan and run time 
was obtained. The execution plan shows that no combiner is used and the two Map 
tasks are not chained when ForwardedFields is enabled. The experiment is 
documented in [2]

[1] 
https://github.com/apache/flink/blob/master/flink-examples/flink-examples-batch/src/main/java/org/apache/flink/examples/java/clustering/KMeans.java
[2] https://drive.google.com/open?id=0B0IlZv0uHBuvVEZ5ZmNpN19jVVU




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to