[ https://issues.apache.org/jira/browse/HIVE-24761?focusedWorklogId=593911&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-593911 ]
ASF GitHub Bot logged work on HIVE-24761: ----------------------------------------- Author: ASF GitHub Bot Created on: 10/May/21 11:10 Start Date: 10/May/21 11:10 Worklog Time Spent: 10m Work Description: abstractdog commented on a change in pull request #2099: URL: https://github.com/apache/hive/pull/2099#discussion_r629269903 ########## File path: ql/src/gen/vectorization/ExpressionTemplates/ColumnArithmeticColumn.txt ########## @@ -34,20 +34,17 @@ public class <ClassName> extends VectorExpression { private static final long serialVersionUID = 1L; - private final int colNum1; private final int colNum2; Review comment: a) regarding input columns, a simple solution is could be https://github.com/apache/hive/pull/2099/commits/5db4c60aec34df291a1d1d85be319c865adcd1ad created convenience constructors, subclasses should be refactored accordingly now with care b) I was also wondering about having an array of inputcolumns as: ``` public int[] inputColumnNums = new int[] { -1, -1, -1 }; ``` this could be more general, more easily extendable, and needs a bit more refactoring (every existing occurrence of inputColumn should become inputColumnNums[0]) so regarding the class naming, we won't change anything (to unary, binary, ...) just use the proper constructor in subclasses -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 593911) Time Spent: 2h 20m (was: 2h 10m) > Vectorization: Support PTF - bounded start windows > -------------------------------------------------- > > Key: HIVE-24761 > URL: https://issues.apache.org/jira/browse/HIVE-24761 > Project: Hive > Issue Type: Sub-task > Reporter: László Bodor > Assignee: László Bodor > Priority: Major > Labels: pull-request-available > Time Spent: 2h 20m > Remaining Estimate: 0h > > {code} > notVectorizedReason: PTF operator: *** only UNBOUNDED start frame is > supported > {code} > Currently, bounded windows are not supported in VectorPTFOperator. If we > simply remove the check compile-time: > https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/Vectorizer.java#L2911 > {code} > if (!windowFrameDef.isStartUnbounded()) { > setOperatorIssue(functionName + " only UNBOUNDED start frame is > supported"); > return false; > } > {code} > We get incorrect results, that's because vectorized codepath completely > ignores boundaries, and simply iterates through all the input batches in > [VectorPTFGroupBatches|https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/exec/vector/ptf/VectorPTFGroupBatches.java#L172]: > {code} > for (VectorPTFEvaluatorBase evaluator : evaluators) { > evaluator.evaluateGroupBatch(batch); > if (isLastGroupBatch) { > evaluator.doLastBatchWork(); > } > } > {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)