[ https://issues.apache.org/jira/browse/HIVE-10484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Mostafa Mokhtar updated HIVE-10484: ----------------------------------- Summary: Vectorization : RuntimeException "Big Table Retained Mapping duplicate column" (was: Vectorization : Big Table Retained Mapping duplicate column) > Vectorization : RuntimeException "Big Table Retained Mapping duplicate column" > ------------------------------------------------------------------------------ > > Key: HIVE-10484 > URL: https://issues.apache.org/jira/browse/HIVE-10484 > Project: Hive > Issue Type: Bug > Components: Tez, Vectorization > Affects Versions: 1.2.0 > Reporter: Mostafa Mokhtar > Assignee: Matt McCline > Fix For: 1.2.0 > > > With vectorization and tez enabled TPC-DS Q70 fails with > {code} > Caused by: java.lang.RuntimeException: Big Table Retained Mapping duplicate > column 6 in ordered column map {6=(value column: 6, type name: int), > 21=(value column: 21, type name: float), 22=(value column: 22, type name: > int)} when adding value column 6, type int > at > org.apache.hadoop.hive.ql.exec.vector.VectorColumnOrderedMap.add(VectorColumnOrderedMap.java:97) > at > org.apache.hadoop.hive.ql.exec.vector.VectorColumnOutputMapping.add(VectorColumnOutputMapping.java:40) > at > org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinCommonOperator.determineCommonInfo(VectorMapJoinCommonOperator.java:320) > at > org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinCommonOperator.<init>(VectorMapJoinCommonOperator.java:254) > at > org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinGenerateResultOperator.<init>(VectorMapJoinGenerateResultOperator.java:89) > at > org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinInnerGenerateResultOperator.<init>(VectorMapJoinInnerGenerateResultOperator.java:97) > at > org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinInnerLongOperator.<init>(VectorMapJoinInnerLongOperator.java:79) > ... 49 more > {code} > Query > {code} > select s_state > from (select s_state as s_state, sum(ss_net_profit), > rank() over ( partition by s_state order by > sum(ss_net_profit) desc) as ranking > from store_sales, store, date_dim > where d_month_seq between 1193 and 1193+11 > and date_dim.d_date_sk = > store_sales.ss_sold_date_sk > and store.s_store_sk = store_sales.ss_store_sk > group by s_state > ) tmp1 > where ranking <= 5 > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)