[ https://issues.apache.org/jira/browse/HIVE-19008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16531857#comment-16531857 ]
Sahil Takiar commented on HIVE-19008: ------------------------------------- [~aihuaxu] could you take a look? The high level idea is that Hive on Spark has a Hive Session ID and a Spark Session ID. I've basically changing the code so that the Spark Session ID is just a counter that is reset for each Hive Session. Previously, the Spark Session ID was just some random UUID, which I don't think is very informative. Furthermore, the Spark Web UI now shows the Hive Session ID rather than the Spark Session ID, which I think is more helpful as it makes it easier to associated entries in the Spark Web UI with a Hive session. > Improve Spark session id logging > -------------------------------- > > Key: HIVE-19008 > URL: https://issues.apache.org/jira/browse/HIVE-19008 > Project: Hive > Issue Type: Sub-task > Components: Spark > Reporter: Sahil Takiar > Assignee: Sahil Takiar > Priority: Major > Attachments: HIVE-19008.1.patch, HIVE-19008.2.patch > > > HoS users have two session ids, one id for the Hive session and another id > for the Spark session, both are UUIDs. > I think some improvements could be made here: > The Spark session id could just be a counter that is incremented for each new > Spark session within a Hive session. Each Spark session is still globally > identifiable by its associated Hive session id + its own counter. This may > make more sense since the Hive session - Spark session has a 1-to-many > relationship, as in a single Hive session can contain multiple Spark > sessions, and each Spark session must belong to a Hive session. > Furthermore, we should include both the Hive session id and Spark session id > in the console logs + the Spark Web UI. -- This message was sent by Atlassian JIRA (v7.6.3#76005)