[ https://issues.apache.org/jira/browse/HIVE-23018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17071220#comment-17071220 ]
Vihang Karajgaonkar commented on HIVE-23018: -------------------------------------------- Thanks for taking a look [~ychena]. the boolean {{isSuccessful}} is a field in the top-level request object and hence I thought it can be shared for all the insert events. Currently, I don't see any usage in Hive (or Spark which reuses the Hive.java code) to send a request which has the successful flag set to false. In case of Impala also, we don't see a need to fire an event which is not successful in the first place. In any case if there is a client who wishes to make a bulk call to fire multiple insert events which has a mix of successful insert events and unsuccessful events then they can make 2 RPC calls one for a successful batch and other for the unsuccessful batch. Currently, I don't see a reason to duplicate the boolean flag for all the insert events in the request which unnecessarily increases the payload of the call. What do you think? > Provide a bulk API to fire multiple insert events > ------------------------------------------------- > > Key: HIVE-23018 > URL: https://issues.apache.org/jira/browse/HIVE-23018 > Project: Hive > Issue Type: Improvement > Reporter: Vihang Karajgaonkar > Assignee: Vihang Karajgaonkar > Priority: Major > Attachments: HIVE-23018.01.patch, HIVE-23018.02.patch, > HIVE-23018.03.patch, HIVE-23018.04.patch > > > Metastore provides a API to fire a listener event (currently only supports > INSERT event). The problem with that API is that it only takes in one > partition at a time. A typical query may insert data into multiple partitions > at a time. In such a case query engines like HS2 or Impala will have to issue > multiple RPCs to metastore sequentially to fire these events. This can show > up as a slowdown to the user if the query engines do not return the prompt to > the user until all the events are fired (In case of HS2 and Impala). It would > be great if we have bulk API which takes in multiple partitions for a table > so that metastore can generate many such events in one RPC. -- This message was sent by Atlassian Jira (v8.3.4#803005)