[ https://issues.apache.org/jira/browse/FLINK-34643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17828316#comment-17828316 ]
Ryan Skraba commented on FLINK-34643: ------------------------------------- Weird – I collected a lot of build logs yesterday from over the weekend that resemble this error, but apparently my comment didn't get added :/ I'll go back and find those links. In the meantime, [~roman]: we are still seeing failures in the same test that seem very related to this issue. Is it possible that this fix is incomplete and should be reopened, or would you prefer that I raise a new JIRA? * [https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=58398&view=logs&j=8fd9202e-fd17-5b26-353c-ac1ff76c8f28&t=ea7cf968-e585-52cb-e0fc-f48de023a7ca&l=8249] {code:java} Mar 19 01:23:06 [not all expected events logged by org.apache.flink.runtime.jobmaster.JobMaster, logged: Mar 19 01:23:06 [Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=INFO Message=Initializing job 'Flink Streaming Job' (2ef7e557551a93ef716b6c3ba580bcd6)., Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=INFO Message=Using restart back off time strategy NoRestartBackoffTimeStrategy for Flink Streaming Job (2ef7e557551a93ef716b6c3ba580bcd6)., Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=INFO Message=Starting execution of job 'Flink Streaming Job' (2ef7e557551a93ef716b6c3ba580bcd6) under job master id 90514ce7689864236ebeb94380dc474d., Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=DEBUG Message=Trigger heartbeat request., Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=INFO Message=Connecting to ResourceManager pekko://flink/user/rpc/resourcemanager_1(8eee414f9dea640cb3668826c12e4976), Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=INFO Message=Resolved ResourceManager address, beginning registration, Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=DEBUG Message=Registration at ResourceManager attempt 1 (timeout=100ms), Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=DEBUG Message=Registration with ResourceManager at pekko://flink/user/rpc/resourcemanager_1 was successful., Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=INFO Message=JobManager successfully registered at ResourceManager, leader id: 8eee414f9dea640cb3668826c12e4976., Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=INFO Message=Stopping the JobMaster for job 'Flink Streaming Job' (2ef7e557551a93ef716b6c3ba580bcd6)., Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=INFO Message=Disconnect TaskExecutor 23ae1952-8d6f-476e-b23b-4fad48feec15 because: Stopping JobMaster for job 'Flink Streaming Job' (2ef7e557551a93ef716b6c3ba580bcd6)., Logger=org.apache.flink.runtime.jobmaster.JobMaster Level=DEBUG Message=Close ResourceManager connection 58e840ebb5c16d7fb17f233b9e93cb3c.]] Mar 19 01:23:06 Expecting empty but was: [Checkpoint storage is set to .*, Mar 19 01:23:06 Running initialization on master for job .*, Mar 19 01:23:06 Starting scheduling.*, Mar 19 01:23:06 State backend is set to .*, Mar 19 01:23:06 Successfully created execution graph from job graph .*, Mar 19 01:23:06 Successfully ran initialization on master.*, Mar 19 01:23:06 Triggering a manual checkpoint for job .*., Mar 19 01:23:06 Using failover strategy .*] Mar 19 01:23:06 at org.apache.flink.test.misc.JobIDLoggingITCase.assertJobIDPresent(JobIDLoggingITCase.java:241) Mar 19 01:23:06 at org.apache.flink.test.misc.JobIDLoggingITCase.testJobIDLogging(JobIDLoggingITCase.java:170) Mar 19 01:23:06 at java.lang.reflect.Method.invoke(Method.java:498) Mar 19 01:23:06 at java.util.concurrent.RecursiveAction.exec(RecursiveAction.java:189) Mar 19 01:23:06 at java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289) Mar 19 01:23:06 at java.util.concurrent.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1056) Mar 19 01:23:06 at java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1692) Mar 19 01:23:06 at java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:175) {code} > JobIDLoggingITCase failed > ------------------------- > > Key: FLINK-34643 > URL: https://issues.apache.org/jira/browse/FLINK-34643 > Project: Flink > Issue Type: Bug > Components: Runtime / Coordination > Affects Versions: 1.20.0 > Reporter: Matthias Pohl > Assignee: Roman Khachatryan > Priority: Major > Labels: pull-request-available, test-stability > Fix For: 1.20.0 > > > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=58187&view=logs&j=8fd9202e-fd17-5b26-353c-ac1ff76c8f28&t=ea7cf968-e585-52cb-e0fc-f48de023a7ca&l=7897 > {code} > Mar 09 01:24:23 01:24:23.498 [ERROR] Tests run: 1, Failures: 0, Errors: 1, > Skipped: 0, Time elapsed: 4.209 s <<< FAILURE! -- in > org.apache.flink.test.misc.JobIDLoggingITCase > Mar 09 01:24:23 01:24:23.498 [ERROR] > org.apache.flink.test.misc.JobIDLoggingITCase.testJobIDLogging(ClusterClient) > -- Time elapsed: 1.459 s <<< ERROR! > Mar 09 01:24:23 java.lang.IllegalStateException: Too few log events recorded > for org.apache.flink.runtime.jobmaster.JobMaster (12) - this must be a bug in > the test code > Mar 09 01:24:23 at > org.apache.flink.util.Preconditions.checkState(Preconditions.java:215) > Mar 09 01:24:23 at > org.apache.flink.test.misc.JobIDLoggingITCase.assertJobIDPresent(JobIDLoggingITCase.java:148) > Mar 09 01:24:23 at > org.apache.flink.test.misc.JobIDLoggingITCase.testJobIDLogging(JobIDLoggingITCase.java:132) > Mar 09 01:24:23 at java.lang.reflect.Method.invoke(Method.java:498) > Mar 09 01:24:23 at > java.util.concurrent.RecursiveAction.exec(RecursiveAction.java:189) > Mar 09 01:24:23 at > java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289) > Mar 09 01:24:23 at > java.util.concurrent.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1056) > Mar 09 01:24:23 at > java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1692) > Mar 09 01:24:23 at > java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:175) > Mar 09 01:24:23 > {code} > The other test failures of this build were also caused by the same test: > * > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=58187&view=logs&j=2c3cbe13-dee0-5837-cf47-3053da9a8a78&t=b78d9d30-509a-5cea-1fef-db7abaa325ae&l=8349 > * > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=58187&view=logs&j=a596f69e-60d2-5a4b-7d39-dc69e4cdaed3&t=712ade8c-ca16-5b76-3acd-14df33bc1cb1&l=8209 -- This message was sent by Atlassian Jira (v8.20.10#820010)