[ 
https://issues.apache.org/jira/browse/HIVE-7421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14066093#comment-14066093
 ] 

Matt McCline commented on HIVE-7421:
------------------------------------

Here is the explain output for query 47 with SPECIAL annotation showing the 
VectorExpression(s):

{code}
STAGE DEPENDENCIES:
  Stage-1 is a root stage
  Stage-0 depends on stages: Stage-1

STAGE PLANS:
  Stage: Stage-1
    Map Reduce
      Map Operator Tree:
          TableScan
            alias: staples
            Statistics: Num rows: 54860 Data size: 158216240 Basic stats: 
COMPLETE Column stats: NONE
            Filter Operator
              predicate: (((concat(to_date(order_date_), ' 00:00:00') = 
'1997-01-01 00:00:00') or (concat(to_date(order_date_), ' 00:00:00') = 
'1997-01-03 00:00:00')) and ((to_date(order_date_) = '1997-01-01') or 
(to_date(order_date_) = '1997-01-03'))) (type: boolean)
              Statistics: Num rows: 54860 Data size: 158216240 Basic stats: 
COMPLETE Column stats: NONE
              vector filter expressions: 
FilterExprAndExpr[-1](FilterExprOrExpr[-1](FilterStringColEqualStringScalar[-1](StringConcatColScalar[51](VectorUDFDateString[50]))
 
FilterStringColEqualStringScalar[-1](StringConcatColScalar[51](VectorUDFDateString[50])))
 
FilterExprOrExpr[-1](FilterStringColEqualStringScalar[-1](VectorUDFDateString[50])
 FilterStringColEqualStringScalar[-1](VectorUDFDateString[50])))
              Select Operator
                expressions: order_priority (type: string)
                outputColumnNames: order_priority
                Statistics: Num rows: 54860 Data size: 158216240 Basic stats: 
COMPLETE Column stats: NONE
                vector select expressions: IdentityExpression[2]
                Group By Operator
                  keys: order_priority (type: string)
                  mode: hash
                  outputColumnNames: _col0
                  Statistics: Num rows: 54860 Data size: 158216240 Basic stats: 
COMPLETE Column stats: NONE
                  Reduce Output Operator
                    key expressions: _col0 (type: string)
                    sort order: +
                    Map-reduce partition columns: _col0 (type: string)
                    Statistics: Num rows: 54860 Data size: 158216240 Basic 
stats: COMPLETE Column stats: NONE
      Execution mode: vectorized
      Reduce Operator Tree:
        Group By Operator
          keys: KEY._col0 (type: string)
          mode: mergepartial
          outputColumnNames: _col0
          Statistics: Num rows: 27430 Data size: 79108120 Basic stats: COMPLETE 
Column stats: NONE
          Select Operator
            expressions: _col0 (type: string)
            outputColumnNames: _col0
            Statistics: Num rows: 27430 Data size: 79108120 Basic stats: 
COMPLETE Column stats: NONE
            File Output Operator
              compressed: false
              Statistics: Num rows: 27430 Data size: 79108120 Basic stats: 
COMPLETE Column stats: NONE
              table:
                  input format: org.apache.hadoop.mapred.TextInputFormat
                  output format: 
org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
                  serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe

  Stage: Stage-0
    Fetch Operator
      limit: -1
      Processor Tree:
        ListSink
{code}

> Null pointer exception involving 
> ql.exec.vector.expressions.StringConcatColScalar.evaluate
> ------------------------------------------------------------------------------------------
>
>                 Key: HIVE-7421
>                 URL: https://issues.apache.org/jira/browse/HIVE-7421
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Matt McCline
>            Assignee: Matt McCline
>         Attachments: TestWithORC.zip, fail_47.sql, fail_62.sql, fail_932.sql
>
>
> One of several found by Raj Bains.
> M/R or Tez.
> {code}
> set hive.vectorized.execution.enabled=true;
> {code}
> Seems very similar to https://issues.apache.org/jira/browse/HIVE-6649
> Query:
> {code}
> SELECT FLOOR((7 + DATEDIFF(`Staples`.`order_date_`, 
> CONCAT(CAST(YEAR(`Staples`.`order_date_`) AS STRING), '-01-01 00:00:00'))  
> +pmod(8 + pmod(datediff(CONCAT(CAST(YEAR(`Staples`.`order_date_`) AS STRING), 
> '-01-01 00:00:00'), '1995-01-01'), 7) - 2, 7) ) / 7) AS `wk_order_date_ok`,   
> SUM(`Staples`.`sales_total`) AS `sum_sales_total_ok` FROM 
> `default`.`testv1_Staples` `Staples` GROUP BY FLOOR((7 + 
> DATEDIFF(`Staples`.`order_date_`, CONCAT(CAST(YEAR(`Staples`.`order_date_`) 
> AS STRING), '-01-01 00:00:00'))  +pmod(8 + 
> pmod(datediff(CONCAT(CAST(YEAR(`Staples`.`order_date_`) AS STRING), '-01-01 
> 00:00:00'), '1995-01-01'), 7) - 2, 7) ) / 7) ;
> {code}
> Stack trace:
> {code}
> Caused by: java.lang.NullPointerException
>       at java.lang.System.arraycopy(Native Method)
>       at 
> org.apache.hadoop.hive.ql.exec.vector.BytesColumnVector.setConcat(BytesColumnVector.java:190)
>       at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.StringConcatColScalar.evaluate(StringConcatColScalar.java:78)
>       at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpression.evaluateChildren(VectorExpression.java:112)
>       at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorUDFDateDiffColCol.evaluate(VectorUDFDateDiffColCol.java:59)
>       at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpression.evaluateChildren(VectorExpression.java:112)
>       at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.gen.LongScalarAddLongColumn.evaluate(LongScalarAddLongColumn.java:65)
>       at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpression.evaluateChildren(VectorExpression.java:112)
>       at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.gen.LongColAddLongColumn.evaluate(LongColAddLongColumn.java:52)
>       at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpression.evaluateChildren(VectorExpression.java:112)
>       at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.LongColDivideLongScalar.evaluate(LongColDivideLongScalar.java:52)
>       at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpression.evaluateChildren(VectorExpression.java:112)
>       at 
> org.apache.hadoop.hive.ql.exec.vector.expressions.gen.FuncFloorDoubleToLong.evaluate(FuncFloorDoubleToLong.java:47)
>       at 
> org.apache.hadoop.hive.ql.exec.vector.VectorHashKeyWrapperBatch.evaluateBatch(VectorHashKeyWrapperBatch.java:147)
>       at 
> org.apache.hadoop.hive.ql.exec.vector.VectorGroupByOperator$ProcessingModeHashAggregate.processBatch(VectorGroupByOperator.java:289)
>       at 
> org.apache.hadoop.hive.ql.exec.vector.VectorGroupByOperator.processOp(VectorGroupByOperator.java:711)
>       at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:800)
>       at 
> org.apache.hadoop.hive.ql.exec.vector.VectorSelectOperator.processOp(VectorSelectOperator.java:139)
>       at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:800)
>       at 
> org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:95)
>       at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:800)
>       at 
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:43)
> {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to