[ https://issues.apache.org/jira/browse/FLINK-23210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Flink Jira Bot updated FLINK-23210: ----------------------------------- Labels: stale-major test-stability (was: test-stability) I am the [Flink Jira Bot|https://github.com/apache/flink-jira-bot/] and I help the community manage its development. I see this issues has been marked as Major but is unassigned and neither itself nor its Sub-Tasks have been updated for 60 days. I have gone ahead and added a "stale-major" to the issue". If this ticket is a Major, please either assign yourself or give an update. Afterwards, please remove the label or in 7 days the issue will be deprioritized. > Running HA per-job cluster (hashmap, sync) end-to-end test failed on azure > -------------------------------------------------------------------------- > > Key: FLINK-23210 > URL: https://issues.apache.org/jira/browse/FLINK-23210 > Project: Flink > Issue Type: Bug > Components: Runtime / Coordination > Affects Versions: 1.13.1 > Reporter: Dawid Wysakowicz > Priority: Major > Labels: stale-major, test-stability > > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=19776&view=logs&j=c88eea3b-64a0-564d-0031-9fdcd7b8abee&t=ff888d9b-cd34-53cc-d90f-3e446d355529&l=11631 > {code} > Jul 01 12:54:55 > Jul 01 12:54:55 > ============================================================================== > Jul 01 12:54:55 Running 'Running HA per-job cluster (hashmap, sync) > end-to-end test' > Jul 01 12:54:55 > ============================================================================== > Jul 01 12:54:55 TEST_DATA_DIR: > /home/vsts/work/1/s/flink-end-to-end-tests/test-scripts/temp-test-directory-55217860509 > Jul 01 12:54:55 Flink dist directory: > /home/vsts/work/1/s/flink-dist/target/flink-1.13-SNAPSHOT-bin/flink-1.13-SNAPSHOT > Jul 01 12:54:55 Flink dist directory: > /home/vsts/work/1/s/flink-dist/target/flink-1.13-SNAPSHOT-bin/flink-1.13-SNAPSHOT > Jul 01 12:54:55 Starting zookeeper daemon on host fv-az91-551. > Jul 01 12:54:55 Running on HA mode: parallelism=4, backend=hashmap, > asyncSnapshots=false, incremSnapshots=false and zk=3.5. > Jul 01 12:54:56 Starting standalonejob daemon on host fv-az91-551. > Jul 01 12:54:56 Start 1 more task managers > Jul 01 12:54:58 Starting taskexecutor daemon on host fv-az91-551. > Jul 01 12:55:02 Job (00000000000000000000000000000000) is not yet running. > Jul 01 12:55:07 Job (00000000000000000000000000000000) is running. > Jul 01 12:55:07 Running JM watchdog @ 351161 > Jul 01 12:55:07 Running TM watchdog @ 351162 > Jul 01 12:55:07 Waiting for text Completed checkpoint [1-9]* for job > 00000000000000000000000000000000 to appear 2 of times in logs... > Jul 01 12:55:08 Killed JM @ 350374 > Jul 01 12:55:08 Waiting for text Completed checkpoint [1-9]* for job > 00000000000000000000000000000000 to appear 2 of times in logs... > grep: /home/vsts/work/_temp/debug_files/flink-logs/*standalonejob-1*.log: No > such file or directory > grep: /home/vsts/work/_temp/debug_files/flink-logs/*standalonejob-1*.log: No > such file or directory > Jul 01 12:55:09 Killed TM @ 350614 > grep: /home/vsts/work/_temp/debug_files/flink-logs/*standalonejob-1*.log: No > such file or directory > Jul 01 12:55:10 Starting standalonejob daemon on host fv-az91-551. > Jul 01 12:55:55 Killed TM @ 351701 > Jul 01 12:55:55 Killed JM @ 351836 > Jul 01 12:55:55 Waiting for text Completed checkpoint [1-9]* for job > 00000000000000000000000000000000 to appear 2 of times in logs... > grep: /home/vsts/work/_temp/debug_files/flink-logs/*standalonejob-2*.log: No > such file or directory > grep: /home/vsts/work/_temp/debug_files/flink-logs/*standalonejob-2*.log: No > such file or directory > Jul 01 12:55:57 Starting standalonejob daemon on host fv-az91-551. > grep: /home/vsts/work/_temp/debug_files/flink-logs/*standalonejob-2*.log: No > such file or directory > Jul 01 12:56:44 Killed TM @ 353554 > Jul 01 12:56:56 Killed TM @ 355735 > Jul 01 13:06:00 A timeout occurred waiting for Completed checkpoint [1-9]* > for job 00000000000000000000000000000000 to appear 2 of times in logs. > Jul 01 13:06:00 Stopping job timeout watchdog (with pid=349933) > Jul 01 13:06:00 Killing JM watchdog @ 351161 > Jul 01 13:06:00 Killing TM watchdog @ 351162 > Jul 01 13:06:00 [FAIL] Test script contains errors. > Jul 01 13:06:00 Checking of logs skipped. > Jul 01 13:06:00 > Jul 01 13:06:00 [FAIL] 'Running HA per-job cluster (hashmap, sync) end-to-end > test' failed after 11 minutes and 5 seconds! Test exited with exit code 1 > Jul 01 13:06:00 > {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)