[ 
https://issues.apache.org/jira/browse/FLINK-4152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15384069#comment-15384069
 ] 

ASF GitHub Bot commented on FLINK-4152:
---------------------------------------

Github user tillrohrmann commented on the issue:

    https://github.com/apache/flink/pull/2257
  
    I think by imposing this contract we actually introduced problems we didn't 
have before. Thus, in order to remedy these problems for the upcoming release 
and go back a bit in the direction of the pre-RM age, I loosened the contract 
of the RM so that it can no longer reject TM registrations. This makes sense in 
my opinion, since we don't have a mean to shut down orphaned TMs anyway.
    
    So in this version, the RM's task is to ensure that at least a predefined 
set of resources is allocated and to notify the JM about a TM death (not 
strictly mandatory).
    
    What we could actually do is to also register orphaned TMs (or ones 
registered by a different RM). Then we wouldn't have the problem that we 
allocate too many resources for a JM.


> TaskManager registration exponential backoff doesn't work
> ---------------------------------------------------------
>
>                 Key: FLINK-4152
>                 URL: https://issues.apache.org/jira/browse/FLINK-4152
>             Project: Flink
>          Issue Type: Bug
>          Components: Distributed Coordination, TaskManager, YARN Client
>            Reporter: Robert Metzger
>            Assignee: Till Rohrmann
>         Attachments: logs.tgz
>
>
> While testing Flink 1.1 I've found that the TaskManagers are logging many 
> messages when registering at the JobManager.
> This is the log file: 
> https://gist.github.com/rmetzger/0cebe0419cdef4507b1e8a42e33ef294
> Its logging more than 3000 messages in less than a minute. I don't think that 
> this is the expected behavior.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to