[jira] [Commented] (HIVE-14199) Enable Bucket Pruning for ACID tables

Gopal V (JIRA) Fri, 12 Aug 2016 17:21:40 -0700

    [ 
https://issues.apache.org/jira/browse/HIVE-14199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15419717#comment-15419717
 ]


Gopal V commented on HIVE-14199:
--------------------------------

[~ekoifman]: The acid_bucket_pruning.q does not have vectorization enabled.

> Enable Bucket Pruning for ACID tables
> -------------------------------------
>
>                 Key: HIVE-14199
>                 URL: https://issues.apache.org/jira/browse/HIVE-14199
>             Project: Hive
>          Issue Type: Improvement
>          Components: Transactions
>            Reporter: Saket Saurabh
>            Assignee: Saket Saurabh
>         Attachments: HIVE-14199.01.patch, HIVE-14199.02.patch, 
> HIVE-14199.03.patch
>
>
> Currently, ACID tables do not benefit from the bucket pruning feature 
> introduced in HIVE-11525. The reason for this has been the fact that bucket 
> pruning happens at split generation level and for ACID, traditionally the 
> delta files were never split. The parallelism for ACID was then restricted to 
> the number of buckets. There would be as many splits as the number of buckets 
> and each worker processing one split would inevitably read all the delta 
> files for that bucket, even when the query may have originally required only 
> one of the buckets to be read.
> However, HIVE-14035 now enables even the delta files to be also split. What 
> this means is that now we have enough information at the split generation 
> level to determine appropriate buckets to process for the delta files. This 
> can efficiently allow us to prune unnecessary buckets for delta files and 
> will lead to good performance gain for a large number of selective queries on 
> ACID tables.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-14199) Enable Bucket Pruning for ACID tables

Reply via email to