[ https://issues.apache.org/jira/browse/FLINK-7076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16182016#comment-16182016 ]
ASF GitHub Bot commented on FLINK-7076: --------------------------------------- GitHub user suez1224 opened a pull request: https://github.com/apache/flink/pull/4729 [FLINK-7076] [ResourceManager] implement YARN stopWorker logic ## What is the purpose of the change *Implement stopWorker logic for YarnResourceManager* ## Brief change log - *Added a ConcurrentHashMap to keep the ResourceID to Yarn ContainerId mappings* - *Implement the stopWorker logic for YARN* ## Verifying this change *(Please pick either of the following options)* This change is a trivial rework / code cleanup without any test coverage. *(or)* This change is already covered by existing tests, such as *(please describe tests)*. *(or)* This change added tests and can be verified as follows: *(example:)* - *Added integration tests for end-to-end deployment with large payloads (100MB)* - *Extended integration test for recovery after master (JobManager) failure* - *Added test that validates that TaskInfo is transferred only once across recoveries* - *Manually verified the change by running a 4 node cluser with 2 JobManagers and 4 TaskManagers, a stateful streaming program, and killing one JobManager and two TaskManagers during the execution, verifying that recovery happens correctly.* ## Does this pull request potentially affect one of the following parts: - Dependencies (does it add or upgrade a dependency): no - The public API, i.e., is any changed class annotated with `@Public(Evolving)`: no - The serializers: no - The runtime per-record code paths (performance sensitive): no - Anything that affects deployment or recovery: JobManager (and its components), Checkpointing, Yarn/Mesos, ZooKeeper: yes ## Documentation - Does this pull request introduce a new feature? no - If yes, how is the feature documented? not applicable You can merge this pull request into a Git repository by running: $ git pull https://github.com/suez1224/flink implement-stopWorker-yarn Alternatively you can review and apply these changes as the patch at: https://github.com/apache/flink/pull/4729.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #4729 ---- commit e82cb580956d3ee323e40f9c5335cf645b32b99d Author: Shuyi Chen <sh...@uber.com> Date: 2017-09-27T05:23:34Z implement YARN stopWorker logic ---- > Implement container release to support dynamic scaling > ------------------------------------------------------ > > Key: FLINK-7076 > URL: https://issues.apache.org/jira/browse/FLINK-7076 > Project: Flink > Issue Type: Sub-task > Components: ResourceManager > Reporter: Till Rohrmann > Assignee: Shuyi Chen > Labels: flip-6 > > In order to support dynamic scaling, the {{YarnResourceManager}} has to be > able to dynamically free containers. We have to implement the > {{YarnResourceManager#stopWorker}} method. -- This message was sent by Atlassian JIRA (v6.4.14#64029)