[ https://issues.apache.org/jira/browse/HIVE-15573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Matt McCline updated HIVE-15573: -------------------------------- Status: Patch Available (was: In Progress) NOTE: In order to compute the bucket/partition hash code, there are cases where VectorExtractRow has to set Hive writable objects so the Java hash code can be obtained. So, the enhancement will probably not be as fast as the UNIFORM hash path. > Vectorization: ACID shuffle ReduceSink is not specialized > ---------------------------------------------------------- > > Key: HIVE-15573 > URL: https://issues.apache.org/jira/browse/HIVE-15573 > Project: Hive > Issue Type: Improvement > Components: Transactions, Vectorization > Affects Versions: 2.2.0 > Reporter: Gopal V > Assignee: Matt McCline > Fix For: 2.2.0 > > Attachments: HIVE-15573.01.patch, HIVE-15573.02.patch, > screenshot-1.png > > > The ACID shuffle disabled murmur hash for the shuffle, due to the bucketing > requirements demanding the writable hashcode for the shuffles. > {code} > boolean useUniformHash = desc.getReducerTraits().contains(UNIFORM); > if (!useUniformHash) { > return false; > } > {code} > This check protects the fast ReduceSink ops from being used in ACID inserts. > A specialized case for the following pattern will make ACID insert much > faster. > {code} > Reduce Output Operator > sort order: > Map-reduce partition columns: _col0 (type: bigint) > value expressions: .... > {code} > !screenshot-1.png! -- This message was sent by Atlassian JIRA (v6.3.4#6332)