[jira] [Created] (FLINK-1476) Flink VS Spark on loop test

xuhong (JIRA) Wed, 04 Feb 2015 19:23:28 -0800

xuhong created FLINK-1476:
-----------------------------

             Summary: Flink VS Spark on loop test
                 Key: FLINK-1476
                 URL: https://issues.apache.org/jira/browse/FLINK-1476
             Project: Flink
          Issue Type: Test
    Affects Versions: 0.8, 0.7.0-incubating
         Environment: 3 machines, every machines has 24 CPU cores and allocate 
16 CPU cores for the tests. The memory situation is: 3 * 32G


            Reporter: xuhong
            Priority: Critical


    In the last days, i did some test on flink and spark. The test results 
shows that flink can do better on many operations, such as GroupBy, Join and 
some complex jobs. But when I do the KMeans, LinearRegression and other loop 
tests, i found that flink is no more excellent than spark. I want to konw, 
whether flink is more comfortable to do the loop jobs with spark.
    I add code: env.setDegreeOfParallelism(16) in each test to allocate same 
CPU cores as in Spark tests.
    My english is not good, i wish you guys can understand me!

the following is some config of my Flnk:

jobmanager.rpc.port: 6123
jobmanager.heap.mb: 2048
taskmanager.heap.mb: 2048
taskmanager.numberOfTaskSlots: 24
parallelization.degree.default: 72

jobmanager.web.port: 8081
webclient.port: 8085

fs.overwrite-files: true
taskmanager.memory.fraction: 0.8
taskmanager.network.numberofBuffers: 70000



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (FLINK-1476) Flink VS Spark on loop test

Reply via email to