Hi Vinaya

1. Comparing streaming tools ( in this case Storm and Flink) should not be
based on performance benchmarks only! For example, slides 16-36 list over 96
criteria, that we identified at Capital One, to compare two streaming tools   
http://www.slideshare.net/sbaltagi/flink-vs-spark/17

2. Now, if you are focusing on performance only, I'll suggest a few related
resources: 

- Benchmarking Streaming Computation Engines at Yahoo!  
http://yahooeng.tumblr.com/post/135321837876/benchmarking-streaming-computation-engines-at
 
December 16, 2015 Code at github:
https://github.com/yahoo/streaming-benchmarks

-  There is some work started by some Flink contributors to create some
performance scripts for Flink, Spark, and MapReduce here: There is Apache
Flink: Performance and Testing  https://github.com/project-flink/flink-perf

- Some first numbers on performance of streaming jobs with Apache Flink are
here:
http://data-artisans.com/high-throughput-low-latency-and-exactly-once-stream-processing-with-apache-flink/
 
under the section: 'Show me the numbers'. Code used is at:
https://github.com/dataArtisans/performance  

- Yangjun Wang is currently working on his Master thesis at Aalto university
in Helsinki, Finland. The topic of his thesis is about building a standard
benchmark system for streaming processing systems like Apache Storm, Spark
and Flink. Code at github
https://github.com/wangyangjun/StreamBench/tree/master/StreamBench

3. I am giving a talk in NYC on Tuesday February 2nd, 2016 on Apache Flink
and I will be touching a bit on benchmarks
http://www.meetup.com/New-York-City-NYC-Apache-Flink-Meetup/events/228113118/
You are welcome to attend. 

Thanks

Slim Baltagi 



--
View this message in context: 
http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Comparison-of-storm-and-flink-tp4468p4469.html
Sent from the Apache Flink User Mailing List archive. mailing list archive at 
Nabble.com.

Reply via email to