[ 
https://issues.apache.org/jira/browse/FLINK-17825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17118335#comment-17118335
 ] 

Robert Metzger commented on FLINK-17825:
----------------------------------------

Another case in the Scala 2.12 profile 
https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=2305&view=logs&j=08866332-78f7-59e4-4f7e-49a56faa3179&t=931b3127-d6ee-5f94-e204-48d51cd1c334
{code}
2020-05-27T21:32:55.3839807Z Starting standalonesession daemon on host fv-az558.
2020-05-27T21:32:56.8802120Z grep: 
/home/vsts/work/1/s/flink-dist/target/flink-1.11-SNAPSHOT-bin/flink-1.11-SNAPSHOT/log/*standalonesession-2*.log:
 No such file or directory
2020-05-27T21:33:37.5553176Z Killed JM @ 94046
2020-05-27T21:33:37.5556291Z Waiting for text Completed checkpoint [1-9]* for 
job 2f93c52fd325fa7410fb665dee80bb20 to appear 2 of times in logs...
2020-05-27T21:33:37.5956808Z grep: 
/home/vsts/work/1/s/flink-dist/target/flink-1.11-SNAPSHOT-bin/flink-1.11-SNAPSHOT/log/*standalonesession-3*.log:
 No such file or directory
2020-05-27T21:33:38.6046010Z grep: 
/home/vsts/work/1/s/flink-dist/target/flink-1.11-SNAPSHOT-bin/flink-1.11-SNAPSHOT/log/*standalonesession-3*.log:
 No such file or directory
2020-05-27T21:33:39.5862716Z Starting standalonesession daemon on host fv-az558.
2020-05-27T21:33:39.6089277Z grep: 
/home/vsts/work/1/s/flink-dist/target/flink-1.11-SNAPSHOT-bin/flink-1.11-SNAPSHOT/log/*standalonesession-3*.log:
 No such file or directory
2020-05-27T21:34:32.8925011Z Killed TM @ 94007
2020-05-27T21:34:45.2218956Z Killed TM @ 98235
2020-05-27T21:39:45.2307885Z Test (pid: 85160) did not finish after 540 seconds.

[...]

[FAIL] 'Running HA (rocks, non-incremental) end-to-end test' failed after 9 
minutes and 0 seconds! Test exited with exit code 1
{code}


> HA end-to-end gets killed due to timeout
> ----------------------------------------
>
>                 Key: FLINK-17825
>                 URL: https://issues.apache.org/jira/browse/FLINK-17825
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / Coordination, Tests
>    Affects Versions: 1.12.0
>            Reporter: Robert Metzger
>            Assignee: Robert Metzger
>            Priority: Critical
>              Labels: test-stability
>
> CI (normal profile): 
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=1867&view=logs&j=c88eea3b-64a0-564d-0031-9fdcd7b8abee&t=1e2bbe5b-4657-50be-1f07-d84bfce5b1f5
> {code}
> 2020-05-19T20:46:50.9034002Z Killed TM @ 104061
> 2020-05-19T20:47:05.8510180Z Killed TM @ 107775
> 2020-05-19T20:47:55.1181475Z Killed TM @ 108337
> 2020-05-19T20:48:16.7907005Z Test (pid: 89099) did not finish after 540 
> seconds.
> 2020-05-19T20:48:16.7907777Z Printing Flink logs and killing it:
> [...]
> 2020-05-19T20:48:19.1016912Z 
> /home/vsts/work/1/s/flink-end-to-end-tests/test-scripts/test_ha_datastream.sh:
>  line 125: 89099 Terminated              ( cmdpid=$BASHPID; ( sleep 
> $TEST_TIMEOUT_SECONDS; echo "Test (pid: $cmdpid) did not finish after 
> $TEST_TIMEOUT_SECONDS seconds."; echo "Printing Flink logs and killing it:"; 
> cat ${FLINK_DIR}/log/*; kill "$cmdpid" ) & watchdog_pid=$!; echo 
> $watchdog_pid > $TEST_DATA_DIR/job_watchdog.pid; run_ha_test 4 
> ${STATE_BACKEND_TYPE} ${STATE_BACKEND_FILE_ASYNC} 
> ${STATE_BACKEND_ROCKS_INCREMENTAL} ${ZOOKEEPER_VERSION} )
> 2020-05-19T20:48:19.1017985Z Stopping job timeout watchdog (with pid=89100)
> 2020-05-19T20:48:19.1018621Z 
> /home/vsts/work/1/s/flink-end-to-end-tests/test-scripts/test_ha_datastream.sh:
>  line 112: kill: (89100) - No such process
> 2020-05-19T20:48:19.1019000Z Killing JM watchdog @ 91127
> 2020-05-19T20:48:19.1019199Z Killing TM watchdog @ 91883
> 2020-05-19T20:48:19.1019424Z [FAIL] Test script contains errors.
> 2020-05-19T20:48:19.1019639Z Checking of logs skipped.
> 2020-05-19T20:48:19.1019785Z 
> 2020-05-19T20:48:19.1020329Z [FAIL] 'Running HA (rocks, non-incremental) 
> end-to-end test' failed after 9 minutes and 0 seconds! Test exited with exit 
> code 1
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to