[ https://issues.apache.org/jira/browse/HIVE-24758?focusedWorklogId=550835&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-550835 ]
ASF GitHub Bot logged work on HIVE-24758: ----------------------------------------- Author: ASF GitHub Bot Created on: 10/Feb/21 18:40 Start Date: 10/Feb/21 18:40 Worklog Time Spent: 10m Work Description: belugabehr commented on a change in pull request #1963: URL: https://github.com/apache/hive/pull/1963#discussion_r573979578 ########## File path: ql/src/java/org/apache/hadoop/hive/ql/exec/tez/TezTask.java ########## @@ -236,6 +239,10 @@ public int execute() { throw new HiveException("Operation cancelled"); } + // Log all the info required to find the various logs for this query + LOG.info("HS2 Host: [{}], Query ID: [{}], Dag ID: [{}], DAG Session ID: [{}]", getHostNameIP(), queryId, Review comment: Hey @pgaref, thanks for the valuable input. So, on one hand it may seem confusing as to the need to log the host name because if you are looking at this log file then of course the hostname is already known. However, as I understand the code, these log messages are sent (redirected) to the client via Thrift RPC APIs. This is helpful information for debugging for the client as it's not otherwise clear which instance of HS2 is processing the query, if for example, a load balancer is between the client and HS2. I had thought about including some sort of HS2 unique ID as well, but I didn't find such a capability in the project already and did not want to introduce here. However, these 4 pieces of information allow a client to report a problem to the admin and allow the admin to grab all the log files: HS2 and YARN TEZ DAG logs. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 550835) Time Spent: 1h 10m (was: 1h) > Log Tez Task DAG ID, DAG Session ID, HS2 Hostname > ------------------------------------------------- > > Key: HIVE-24758 > URL: https://issues.apache.org/jira/browse/HIVE-24758 > Project: Hive > Issue Type: Improvement > Reporter: David Mollitor > Assignee: David Mollitor > Priority: Major > Labels: pull-request-available > Time Spent: 1h 10m > Remaining Estimate: 0h > > In order to get the logs for a particular query, submitted to Tez on YARN, > the following pieces of information are required: > * YARN Application ID > * TEZ DAG ID > * HS2 Host that ran the job > Include this information in TezTask output. -- This message was sent by Atlassian Jira (v8.3.4#803005)