[ 
https://issues.apache.org/jira/browse/KUDU-3504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17756954#comment-17756954
 ] 

ASF subversion and git services commented on KUDU-3504:
-------------------------------------------------------

Commit fd98e9e7331a0e8fc6b091faa2d2744a7787e6d7 in kudu's branch 
refs/heads/master from Attila Bukor
[ https://gitbox.apache.org/repos/asf?p=kudu.git;h=fd98e9e73 ]

KUDU-3504 Crash master on subprocess death

In the past, there were several instances when the Ranger subprocess
crashed.  In these cases, the master happily went on, but failed to
authorize requests. Since there's no way to restart the subprocess
without restarting the master anyway, it's better to crash the master as
well to make sure the failure of the subprocess is detected in time and
can be addressed.

As there are multiple concurrent calls to Subprocess::DoWait() now, this
commit also changes some member variables to atomic to make sure it's
thread-safe as TSAN complained about a data race.

Change-Id: Iec516f3d684f152bd29874b60b810c526ee5a184
Reviewed-on: http://gerrit.cloudera.org:8080/20365
Tested-by: Kudu Jenkins
Reviewed-by: Marton Greber <greber...@gmail.com>
Reviewed-by: Alexey Serbin <ale...@apache.org>


> Crash master on subprocess failure
> ----------------------------------
>
>                 Key: KUDU-3504
>                 URL: https://issues.apache.org/jira/browse/KUDU-3504
>             Project: Kudu
>          Issue Type: Improvement
>            Reporter: Attila Bukor
>            Assignee: Attila Bukor
>            Priority: Major
>
> If the Ranger subprocess crashes, authorization will fail, but there's no 
> other indication that the process has died. The master won't restart the 
> subprocess and there's no way to restart it manually without restarting Kudu 
> anyway. Furthermore, if the subprocess crashes, that's likely a symptom of 
> something that requires manual intervention to resolve to avoid crashing in 
> the future, so it's best if the master crashes upon the subprocess crashing 
> as well.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to