[ https://issues.apache.org/jira/browse/FLINK-9196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16457508#comment-16457508 ]
ASF GitHub Bot commented on FLINK-9196: --------------------------------------- GitHub user GJL opened a pull request: https://github.com/apache/flink/pull/5938 [FLINK-9196][flip6, yarn] Cleanup application files when deregistering YARN AM ## What is the purpose of the change *Ensure that YARN application files are removed if cluster is shutdown.* cc: @StephanEwen @tillrohrmann ## Brief change log - *Enable graceful cluster shut down via HTTP.* - *Remove Flink application files from remote file system when the YarnResourceManager deregisters the YARN ApplicationMaster. ## Verifying this change This change added tests and can be verified as follows: - *Manually verified that files are removed from HDFS when running stream (attached/detached) and batch jobs (attached).* - *Manually verified that files are removed from HDFS when running stopping a yarn session gracefully.* ## Does this pull request potentially affect one of the following parts: - Dependencies (does it add or upgrade a dependency): (yes / **no**) - The public API, i.e., is any changed class annotated with `@Public(Evolving)`: (yes / **no**) - The serializers: (yes / **no** / don't know) - The runtime per-record code paths (performance sensitive): (yes / **no** / don't know) - Anything that affects deployment or recovery: JobManager (and its components), Checkpointing, Yarn/Mesos, ZooKeeper: (**yes** / no / don't know) - The S3 file system connector: (yes / **no** / don't know) ## Documentation - Does this pull request introduce a new feature? (yes / **no**) - If yes, how is the feature documented? (**not applicable** / docs / JavaDocs / not documented) You can merge this pull request into a Git repository by running: $ git pull https://github.com/GJL/flink FLINK-9196 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/flink/pull/5938.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #5938 ---- commit 6f0c0aed8a5b54814ed2e0fa761f06317592e4b3 Author: gyao <gary@...> Date: 2018-04-19T08:29:43Z [hotfix] Replace String concatenation with Slf4j placeholders. commit 34b5b40fec62502579a3f3804839c1e9d1e95952 Author: gyao <gary@...> Date: 2018-04-19T09:03:20Z [hotfix] Indent method parameters. commit bcb0f24ec587c15287c6144d1c088a5327d98c6d Author: gyao <gary@...> Date: 2018-04-19T09:04:27Z [hotfix] Remove unnecessary int cast. commit 264b3e664fe84583ab8e372824f6d4424627e6e1 Author: gyao <gary@...> Date: 2018-04-19T09:05:05Z [hotfix] Fix raw types warning. commit 1b6eb96b3d287a20ea86606fd01b5e10564c3f5d Author: gyao <gary@...> Date: 2018-04-19T09:18:32Z [hotfix][tests] Rename UtilsTest to YarnFlinkResourceManagerTest. Test was misnamed. commit e8d43ff72a2861713db934fe42163fac6d9ecb8d Author: gyao <gary@...> Date: 2018-04-26T15:38:20Z [hotfix][mesos] Delete unused class FlinkMesosSessionCli. commit a4f9a5c6a44f08aa5f4a8dbbfb28a0bdb562b8c5 Author: gyao <gary@...> Date: 2018-04-26T15:44:56Z [hotfix][yarn] Remove unused field appReport in YarnClusterClient. commit 1260dfac974670f325b21d175e1e29064530bb53 Author: gyao <gary@...> Date: 2018-04-19T10:07:54Z [FLINK-9196][flip6, yarn] Cleanup application files when deregistering YARN AM Enable graceful cluster shut down via HTTP. Remove Flink application files from remote file system when the YarnResourceManager deregisters the YARN ApplicationMaster. ---- > YARN: Flink binaries are not deleted from HDFS after cluster shutdown > --------------------------------------------------------------------- > > Key: FLINK-9196 > URL: https://issues.apache.org/jira/browse/FLINK-9196 > Project: Flink > Issue Type: Bug > Components: YARN > Affects Versions: 1.5.0 > Reporter: Gary Yao > Assignee: Gary Yao > Priority: Blocker > Labels: flip-6 > Fix For: 1.5.0 > > Attachments: 0001-xxx.patch > > > When deploying on YARN in flip6 mode, the Flink binaries are not deleted from > HDFS after the cluster shuts down. > *Steps to reproduce* > # Submit job in YARN job mode, non-detached: > {noformat} HADOOP_CLASSPATH=`hadoop classpath` bin/flink run -m yarn-cluster > -yjm 2048 -ytm 2048 ./examples/streaming/WordCount.jar {noformat} > # Check contents of {{/user/hadoop/.flink/<application_id>}} on HDFS after > job is finished: > {noformat} > [hadoop@ip-172-31-43-78 flink-1.5.0]$ hdfs dfs -ls > /user/hadoop/.flink/application_1523966184826_0016 > Found 6 items > -rw-r--r-- 1 hadoop hadoop 583 2018-04-17 14:54 > /user/hadoop/.flink/application_1523966184826_0016/90cf5b3a-039e-4d52-8266-4e9563d74827-taskmanager-conf.yaml > -rw-r--r-- 1 hadoop hadoop 332 2018-04-17 14:54 > /user/hadoop/.flink/application_1523966184826_0016/application_1523966184826_0016-flink-conf.yaml3818971235442577934.tmp > -rw-r--r-- 1 hadoop hadoop 89779342 2018-04-02 17:08 > /user/hadoop/.flink/application_1523966184826_0016/flink-dist_2.11-1.5.0.jar > drwxrwxrwx - hadoop hadoop 0 2018-04-17 14:54 > /user/hadoop/.flink/application_1523966184826_0016/lib > -rw-r--r-- 1 hadoop hadoop 1939 2018-04-02 15:37 > /user/hadoop/.flink/application_1523966184826_0016/log4j.properties > -rw-r--r-- 1 hadoop hadoop 2331 2018-04-02 15:37 > /user/hadoop/.flink/application_1523966184826_0016/logback.xml > {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)