[ https://issues.apache.org/jira/browse/FLINK-6434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16236173#comment-16236173 ]
ASF GitHub Bot commented on FLINK-6434: --------------------------------------- Github user tillrohrmann commented on a diff in the pull request: https://github.com/apache/flink/pull/4937#discussion_r148592528 --- Diff: flink-runtime/src/main/java/org/apache/flink/runtime/instance/SlotPool.java --- @@ -361,9 +374,19 @@ private void slotRequestToResourceManagerFailed(AllocationID allocationID, Throw } private void checkTimeoutSlotAllocation(AllocationID allocationID) { + removePendingRequestWithException(allocationID, new TimeoutException("Slot allocation request " + allocationID + " timed out")); + } + + private void removePendingRequestWithException(AllocationID allocationID, Exception e) { --- End diff -- maybe we could refactor this method into `failPendingRequest(PendingRequest, Exception)`, then it could be used by `checkTimeoutSlotAllocation` and `checkTimeoutRequestWaitingForResourceManager` > There may be allocatedSlots leak in SlotPool > -------------------------------------------- > > Key: FLINK-6434 > URL: https://issues.apache.org/jira/browse/FLINK-6434 > Project: Flink > Issue Type: Bug > Components: Cluster Management > Reporter: shuai.xu > Assignee: shuai.xu > Priority: Major > Labels: flip-6 > > If the call allocateSlot() from Execution to Slotpool timeout, the job will > begin to failover, but the pending request are still in SlotPool, if then a > new slot register to SlotPool, it may be fulfill the outdated pending request > and be added to allocatedSlots, but it will never be used and will never be > recycled. -- This message was sent by Atlassian JIRA (v6.4.14#64029)