Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/22662 )

Change subject: IMPALA-3841: Enable late materialization for collections
......................................................................


Patch Set 7:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/22662/2//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/22662/2//COMMIT_MSG@13
PS2, Line 13: For a collection column, late materialization takes effect only 
when the
            : collection
> Agree, no need to do it here.
This can add some miner perf regression, right? I mean that in case there are 
many rows where the collection column is empty without the !empty() predicate 
the scanner won't drop the row and pass it to the next node, and only unnest 
node will drop it. I think that this should be noticeable only in very extreme 
cases but it could be noted in the commit message. An idea on how to avoid this 
in some cases is to keep the !empty() pred in case there is not other top level 
predicate (not sure how hard it is to add this in the planner).


http://gerrit.cloudera.org:8080/#/c/22662/7/be/src/exec/parquet/hdfs-parquet-scanner.cc
File be/src/exec/parquet/hdfs-parquet-scanner.cc:

http://gerrit.cloudera.org:8080/#/c/22662/7/be/src/exec/parquet/hdfs-parquet-scanner.cc@2385
PS7, Line 2385: late_materialization_threshold_
I think that this is not the right place to calculate this, as this will be 
called many times - place called only during initialization would be better.



--
To view, visit http://gerrit.cloudera.org:8080/22662
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia21bdfa6811408d66d74367e0a9520e20951105f
Gerrit-Change-Number: 22662
Gerrit-PatchSet: 7
Gerrit-Owner: Xuebin Su <x...@cloudera.com>
Gerrit-Reviewer: Csaba Ringhofer <csringho...@cloudera.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Xuebin Su <x...@cloudera.com>
Gerrit-Reviewer: Zoltan Borok-Nagy <borokna...@cloudera.com>
Gerrit-Comment-Date: Wed, 30 Apr 2025 14:42:55 +0000
Gerrit-HasComments: Yes

Reply via email to