Oliver Draese created HIVE-22117:
------------------------------------

             Summary: Clean up RuntimeException code in AMReporter
                 Key: HIVE-22117
                 URL: https://issues.apache.org/jira/browse/HIVE-22117
             Project: Hive
          Issue Type: Bug
          Components: llap
    Affects Versions: 3.1.1
            Reporter: Oliver Draese
            Assignee: Oliver Draese


The AMReporter of LLAP throws RuntimExceptions from within addTaskAttempt and 
removeTaskAttempt. These can cause LLAP to come down.

As an interims fix (see HIVE-22113), the RuntimeException of removeTaskAttemp 
is caught from within TaskRunnerCallable, preventing LLAP termination if a 
killed task is not found in AMReporter.

Ideally, we would just log this on removeTask (a gone task is a gone task) and 
have a checked exception in addTaskAttempt. If the checkedException is caught, 
we should fail the task attempt (as there is already an attempt with this ID 
running).



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

Reply via email to