[ https://issues.apache.org/jira/browse/HIVE-21912?focusedWorklogId=271781&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-271781 ]
ASF GitHub Bot logged work on HIVE-21912: ----------------------------------------- Author: ASF GitHub Bot Created on: 03/Jul/19 17:45 Start Date: 03/Jul/19 17:45 Worklog Time Spent: 10m Work Description: odraese commented on pull request #698: HIVE-21912: Implement DisablingDaemonStatisticsHandler URL: https://github.com/apache/hive/pull/698#discussion_r300021605 ########## File path: common/src/java/org/apache/hadoop/hive/conf/HiveConf.java ########## @@ -4358,6 +4358,40 @@ private static void populateLlapDaemonVarsSet(Set<String> llapDaemonVarsSetLocal "The listener which is called when new Llap Daemon statistics is received on AM side.\n" + "The listener should implement the " + "org.apache.hadoop.hive.llap.tezplugins.metrics.LlapMetricsListener interface."), + LLAP_TASK_SCHEDULER_BLACKLISTING_METRICS_LISTENER_MIN_SERVED_TASKS( + "hive.llap.task.scheduler.blacklisting.metrics.listener.min.served.tasks", 2000, + "If the number of tasks served by a node is below this number then we will ignore the node\n" + Review comment: Maybe instead of formulating a negation here "if below x than we don't do something", we should simplify the statement to something like "minimum amount of executed tasks for limping node detection"... ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 271781) Time Spent: 2.5h (was: 2h 20m) > Implement BlacklistingLlapMetricsListener > ----------------------------------------- > > Key: HIVE-21912 > URL: https://issues.apache.org/jira/browse/HIVE-21912 > Project: Hive > Issue Type: Sub-task > Components: llap, Tez > Reporter: Peter Vary > Assignee: Peter Vary > Priority: Major > Labels: pull-request-available > Attachments: HIVE-21912.patch, HIVE-21912.wip-2.patch, > HIVE-21912.wip.patch > > Time Spent: 2.5h > Remaining Estimate: 0h > > We should implement a DaemonStatisticsHandler which: > * If a node average response time is bigger than 150% (configurable) of the > other nodes > * If the other nodes has enough empty executors to handle the requests > Then disables the limping node. -- This message was sent by Atlassian JIRA (v7.6.3#76005)