[ 
https://issues.apache.org/jira/browse/FLINK-12038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16888482#comment-16888482
 ] 

shuai.xu commented on FLINK-12038:
----------------------------------

This failure can be easily re-produced in my local machine. I enabled the logs 
of YARN, and found the reason. You can find the log of unregisterAM in 
jobmanager.log. When the job is finished, it will try to unregisterAM to YARN. 
In fact, it is not necessary to call killApplication, as the whole YARN mini 
cluster will be closed in the tearDown of test case. 

The bellowing is part of logs of job master:

2019-07-16 18:20:34,376 INFO 
org.apache.flink.runtime.executiongraph.ExecutionGraph - Source: Custom Source 
(1/2) (e13567c7f2d7a389c74f4583a67e34e8) switched from SCHEDULED to DEPLOYING.
2019-07-16 18:20:34,376 INFO 
org.apache.flink.runtime.executiongraph.ExecutionGraph - Deploying Source: 
Custom Source (1/2) (attempt #0) to container_1563272405568_0001_01_000002 @ 
e011239174096.et15sqa (dataPort=42072)
2019-07-16 18:20:34,404 INFO 
org.apache.flink.runtime.executiongraph.ExecutionGraph - Source: Custom Source 
(2/2) (fc3d9d65a75eabaf00d7d9372d2b9884) switched from SCHEDULED to DEPLOYING.
2019-07-16 18:20:34,405 INFO 
org.apache.flink.runtime.executiongraph.ExecutionGraph - Deploying Source: 
Custom Source (2/2) (attempt #0) to container_1563272405568_0001_01_000003 @ 
e011239174096.et15sqa (dataPort=41793)
2019-07-16 18:20:34,405 INFO 
org.apache.flink.runtime.executiongraph.ExecutionGraph - Sink: Unnamed (1/2) 
(65db57ac7166e0a96a3c5318bb262fb0) switched from SCHEDULED to DEPLOYING.
2019-07-16 18:20:34,414 INFO 
org.apache.flink.runtime.executiongraph.ExecutionGraph - Deploying Sink: 
Unnamed (1/2) (attempt #0) to container_1563272405568_0001_01_000003 @ 
e011239174096.et15sqa (dataPort=41793)
2019-07-16 18:20:34,447 INFO 
org.apache.flink.runtime.executiongraph.ExecutionGraph - Sink: Unnamed (2/2) 
(22c3e0c0fd37dd00e75fcf855e2a6ca4) switched from SCHEDULED to DEPLOYING.
2019-07-16 18:20:34,447 INFO 
org.apache.flink.runtime.executiongraph.ExecutionGraph - Deploying Sink: 
Unnamed (2/2) (attempt #0) to container_1563272405568_0001_01_000002 @ 
e011239174096.et15sqa (dataPort=42072)
2019-07-16 18:20:34,897 INFO 
org.apache.flink.runtime.executiongraph.ExecutionGraph - Source: Custom Source 
(1/2) (e13567c7f2d7a389c74f4583a67e34e8) switched from DEPLOYING to RUNNING.
2019-07-16 18:20:34,949 INFO 
org.apache.flink.runtime.executiongraph.ExecutionGraph - Source: Custom Source 
(2/2) (fc3d9d65a75eabaf00d7d9372d2b9884) switched from DEPLOYING to RUNNING.
2019-07-16 18:20:35,056 INFO 
org.apache.flink.runtime.executiongraph.ExecutionGraph - Sink: Unnamed (1/2) 
(65db57ac7166e0a96a3c5318bb262fb0) switched from DEPLOYING to RUNNING.
2019-07-16 18:20:35,067 INFO 
org.apache.flink.runtime.executiongraph.ExecutionGraph - Sink: Unnamed (2/2) 
(22c3e0c0fd37dd00e75fcf855e2a6ca4) switched from DEPLOYING to RUNNING.
2019-07-16 18:20:35,450 INFO 
org.apache.flink.runtime.executiongraph.ExecutionGraph - Source: Custom Source 
(2/2) (fc3d9d65a75eabaf00d7d9372d2b9884) switched from RUNNING to FINISHED.
2019-07-16 18:20:35,480 INFO 
org.apache.flink.runtime.executiongraph.ExecutionGraph - Source: Custom Source 
(1/2) (e13567c7f2d7a389c74f4583a67e34e8) switched from RUNNING to FINISHED.
2019-07-16 18:20:35,494 INFO 
org.apache.flink.runtime.executiongraph.ExecutionGraph - Sink: Unnamed (2/2) 
(22c3e0c0fd37dd00e75fcf855e2a6ca4) switched from RUNNING to FINISHED.
2019-07-16 18:20:35,508 INFO 
org.apache.flink.runtime.executiongraph.ExecutionGraph - Sink: Unnamed (1/2) 
(65db57ac7166e0a96a3c5318bb262fb0) switched from RUNNING to FINISHED.
2019-07-16 18:20:35,513 INFO 
org.apache.flink.runtime.executiongraph.ExecutionGraph - Job Flink Streaming 
Job (2f9313ea4fd33bef68111ed380a2ae1b) switched from state RUNNING to FINISHED.
2019-07-16 18:20:35,513 INFO 
org.apache.flink.runtime.checkpoint.CheckpointCoordinator - Stopping checkpoint 
coordinator for job 2f9313ea4fd33bef68111ed380a2ae1b.
2019-07-16 18:20:35,513 INFO 
org.apache.flink.runtime.checkpoint.StandaloneCompletedCheckpointStore - 
Shutting down
2019-07-16 18:20:35,564 INFO org.apache.flink.runtime.dispatcher.MiniDispatcher 
- Job 2f9313ea4fd33bef68111ed380a2ae1b reached globally terminal state FINISHED.
2019-07-16 18:20:35,573 INFO org.apache.flink.runtime.jobmaster.JobMaster - 
Stopping the JobMaster for job Flink Streaming 
Job(2f9313ea4fd33bef68111ed380a2ae1b).
2019-07-16 18:20:35,664 INFO 
org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl - Suspending SlotPool.
2019-07-16 18:20:35,666 INFO org.apache.flink.runtime.jobmaster.JobMaster - 
Close ResourceManager connection 165d22977dc31b3b410489789fdc1050: JobManager 
is shutting down..
2019-07-16 18:20:35,668 INFO 
org.apache.flink.runtime.jobmaster.slotpool.SlotPoolImpl - Stopping SlotPool.
2019-07-16 18:20:35,668 INFO org.apache.flink.yarn.YarnResourceManager - 
Disconnect job manager 
00000000000000000000000000000...@akka.tcp://flink@e011239174096.et15sqa:50088/user/jobmanager_0
 for job 2f9313ea4fd33bef68111ed380a2ae1b from the resource manager.
2019-07-16 18:20:35,681 INFO 
org.apache.flink.runtime.jobmaster.JobManagerRunner - JobManagerRunner already 
shutdown.
2019-07-16 18:20:36,844 INFO 
org.apache.flink.runtime.entrypoint.ClusterEntrypoint - Shutting 
YarnJobClusterEntrypoint down with application status SUCCEEDED. Diagnostics 
null.
2019-07-16 18:20:36,844 INFO 
org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint - Shutting down 
rest endpoint.
2019-07-16 18:20:36,889 INFO 
org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint - Removing cache 
directory /tmp/flink-web-0b120d31-7130-4fcb-bfa8-699abdd5b81e/flink-web-ui
2019-07-16 18:20:36,890 INFO 
org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint - 
http://e011239174096.et15sqa:38342 lost leadership
2019-07-16 18:20:36,892 INFO 
org.apache.flink.runtime.jobmaster.MiniDispatcherRestEndpoint - Shut down 
complete.
2019-07-16 18:20:36,895 INFO org.apache.flink.yarn.YarnResourceManager - Shut 
down cluster because application is in SUCCEEDED, diagnostics null.
2019-07-16 18:20:36,896 INFO org.apache.flink.yarn.YarnResourceManager - 
{color:#FF0000}Unregister application from the YARN Resource Manager with final 
status SUCCEEDED.{color}
2019-07-16 18:20:36,981 INFO 
org.apache.hadoop.yarn.client.api.impl.AMRMClientImpl - Waiting for application 
to be successfully unregistered.

> YARNITCase stalls on travis
> ---------------------------
>
>                 Key: FLINK-12038
>                 URL: https://issues.apache.org/jira/browse/FLINK-12038
>             Project: Flink
>          Issue Type: Bug
>          Components: Deployment / YARN, Tests
>    Affects Versions: 1.9.0
>            Reporter: Chesnay Schepler
>            Assignee: shuai.xu
>            Priority: Critical
>              Labels: test-stability
>             Fix For: 1.9.0
>
>
> https://travis-ci.org/apache/flink/jobs/511932978



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

Reply via email to