[ 
https://issues.apache.org/jira/browse/FLINK-10063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16574452#comment-16574452
 ] 

ASF GitHub Bot commented on FLINK-10063:
----------------------------------------

tillrohrmann commented on issue #6496: [FLINK-10063][tests] Use runit to 
supervise mesos processes.
URL: https://github.com/apache/flink/pull/6496#issuecomment-411672800
 
 
   Merging this PR.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Jepsen: Automatically restart Mesos Processes
> ---------------------------------------------
>
>                 Key: FLINK-10063
>                 URL: https://issues.apache.org/jira/browse/FLINK-10063
>             Project: Flink
>          Issue Type: Bug
>          Components: Tests
>    Affects Versions: 1.6.0
>            Reporter: Gary Yao
>            Assignee: Gary Yao
>            Priority: Critical
>              Labels: pull-request-available
>             Fix For: 1.6.1, 1.7.0
>
>
> Use a process supervisor to automatically restart Mesos processes. This is 
> needed because Mesos uses a "fail-fast" approach to error handling, e.g., the 
> Mesos master will exit when it discovers it has been partitioned away from 
> the Zookeeper quorum. Currently the some of the tests cannot pass because the 
> Mesos processes exiting.
> *Acceptance Criteria*
> * Running tests with {{--deployment-mode mesos-session}} should not fail due 
> to reasons related to the Mesos setup.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to