[ 
https://issues.apache.org/jira/browse/IMPALA-14051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17956456#comment-17956456
 ] 

ASF subversion and git services commented on IMPALA-14051:
----------------------------------------------------------

Commit f545a0828df9874881a224b49575e0aebb195dab in impala's branch 
refs/heads/master from Sai Hemanth Gantasala
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=f545a0828 ]

IMPALA-14051: Use batch insert HMS API to fire insert events

Firing insert statements from Impala can be improved by leveraging the
batch insert HMS API addWriteNotificationLogInBatch() available (via
HIVE-25205) starting from Hive 4. Right now, Impala is using a single
call for each partition, so firing insert statements on a huge
partitioned table can become a performance bottleneck.

This patch addresses the above concern by leveraging
addWriteNotificationLogInBatch() API to fire batch insert partition
events.

Note: This optimization is applicable only for transactional partitioned
tables. The external partitioned table case is already handled.

Change-Id: Iff4a2691631fe9e9e1dc07714c69580a4ace6a8b
Reviewed-on: http://gerrit.cloudera.org:8080/22869
Tested-by: Impala Public Jenkins <[email protected]>
Reviewed-by: Riza Suminto <[email protected]>


> Use batch insert HMS API to fire insert events from Impala
> ----------------------------------------------------------
>
>                 Key: IMPALA-14051
>                 URL: https://issues.apache.org/jira/browse/IMPALA-14051
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Catalog
>            Reporter: Sai Hemanth Gantasala
>            Assignee: Sai Hemanth Gantasala
>            Priority: Major
>
> Firing insert statements from Impala can be improved by leveraging the batch 
> insert HMS API. Right now, Impala is using a single call for each partition, 
> so firing insert statements on a huge partitioned table (look at logs below) 
> can become a performance bottleneck.
> {code:java}
> I0430 21:08:39.927217   826 MetastoreShim.java:433] 
> ed40d2f54233490f:5d56c42b00000000] Firing write notification log request for 
> table highpartition_5k_100_hive2.stocks_highpartition_1002 on partition 
> [2023, Jan, 6]
> I0430 21:08:40.134413   826 MetastoreShim.java:433] 
> ed40d2f54233490f:5d56c42b00000000] Firing write notification log request for 
> table highpartition_5k_100_hive2.stocks_highpartition_1002 on partition 
> [2023, Jan, 9]
> I0430 21:08:40.284446   826 MetastoreShim.java:433] 
> ed40d2f54233490f:5d56c42b00000000] Firing write notification log request for 
> table highpartition_5k_100_hive2.stocks_highpartition_1002 on partition 
> [2023, Jan, 8]
> I0430 21:08:40.409381   826 MetastoreShim.java:433] 
> ed40d2f54233490f:5d56c42b00000000] Firing write notification log request for 
> table highpartition_5k_100_hive2.stocks_highpartition_1002 on partition 
> [2023, Jan, 3]
> I0430 21:08:40.534400   826 MetastoreShim.java:433] 
> ed40d2f54233490f:5d56c42b00000000] Firing write notification log request for 
> table highpartition_5k_100_hive2.stocks_highpartition_1002 on partition 
> [2023, Jan, 2]
> I0430 21:08:40.659636   826 MetastoreShim.java:433] 
> ed40d2f54233490f:5d56c42b00000000] Firing write notification log request for 
> table highpartition_5k_100_hive2.stocks_highpartition_1002 on partition 
> [2023, Jan, 5]
> I0430 21:08:40.884773   826 MetastoreShim.java:433] 
> ed40d2f54233490f:5d56c42b00000000] Firing write notification log request for 
> table highpartition_5k_100_hive2.stocks_highpartition_1002 on partition 
> [2014, Sep, 19]
> I0430 21:08:41.076419   826 MetastoreShim.java:433] 
> ed40d2f54233490f:5d56c42b00000000] Firing write notification log request for 
> table highpartition_5k_100_hive2.stocks_highpartition_1002 on partition 
> [2014, Sep, 18]
> I0430 21:08:41.251446   826 MetastoreShim.java:433] 
> ed40d2f54233490f:5d56c42b00000000] Firing write notification log request for 
> table highpartition_5k_100_hive2.stocks_highpartition_1002 on partition 
> [2023, Jan, 4]
> I0430 21:08:41.443357   826 MetastoreShim.java:433] 
> ed40d2f54233490f:5d56c42b00000000] Firing write notification log request for 
> table highpartition_5k_100_hive2.stocks_highpartition_1002 on partition 
> [2014, Sep, 17]
> I0430 21:08:41.618523   826 MetastoreShim.java:433] 
> ed40d2f54233490f:5d56c42b00000000] Firing write notification log request for 
> table highpartition_5k_100_hive2.stocks_highpartition_1002 on partition 
> [2019, Dec, 3]
> I0430 21:08:42.110314   826 MetastoreShim.java:433] 
> ed40d2f54233490f:5d56c42b00000000] Firing write notification log request for 
> table highpartition_5k_100_hive2.stocks_highpartition_1002 on partition 
> [2014, Sep, 16]
> I0430 21:08:42.218703   826 MetastoreShim.java:433] 
> ed40d2f54233490f:5d56c42b00000000] Firing write notification log request for 
> table highpartition_5k_100_hive2.stocks_highpartition_1002 on partition 
> [2019, Dec, 2]
> I0430 21:08:42.327061   826 MetastoreShim.java:433] 
> ed40d2f54233490f:5d56c42b00000000] Firing write notification log request for 
> table highpartition_5k_100_hive2.stocks_highpartition_1002 on partition 
> [2014, Sep, 15] {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to