Stephan Ewen created FLINK-2586: ----------------------------------- Summary: Unstable Storm Compatibility Tests Key: FLINK-2586 URL: https://issues.apache.org/jira/browse/FLINK-2586 Project: Flink Issue Type: Bug Components: Storm Compatibility Affects Versions: 0.10 Reporter: Stephan Ewen Priority: Critical Fix For: 0.10
The Storm Compatibility tests frequently fail. The reason is that they kill the topologies after a certain time interval. That may fail on CI infrastructure when certain steps are delayed beyond usual. Trying to guarantee progress by time is inherently problematic: - Waiting too short makes tests unstable - Waiting too long makes tests slow The right way to go is letting the program decide when to terminate, for example by throwing a special {{SuccessException}}. Have a look at the Kafka connector tests, they do this a lot and hence run exactly as short or as long as they need to. Here is an example of a failed run: https://s3.amazonaws.com/archive.travis-ci.org/jobs/77499577/log.txt -- This message was sent by Atlassian JIRA (v6.3.4#6332)