Xuebin Su has uploaded this change for review. ( http://gerrit.cloudera.org:8080/23779
Change subject: IMPALA-14619: Reset levels_readahead_ for late materialization ...................................................................... IMPALA-14619: Reset levels_readahead_ for late materialization Previously, `BaseScalarColumnReader::levels_readahead_` was not reset when the reader did not do page filtering. If a query selected the last row containing a collection value in a row group, `levels_readahead_` would be set and would not be reset when advancing to the next row group without page filtering. As a result, trying to skip collection values at the start of the next row group would cause a check failure. This patch fixes the failure by resetting `levels_readahead_` in `BaseScalarColumnReader::Reset()`, which is always called when advancing to the next row group. `levels_readahead_` is also moved out of the "Members used for page filtering" section as the variable is also used in late materialization. Testing: - Added an E2E test for the fix. Change-Id: Idac138ffe4e1a9260f9080a97a1090b467781d00 --- M be/src/exec/parquet/parquet-column-readers.cc M be/src/exec/parquet/parquet-column-readers.h M testdata/workloads/functional-query/queries/QueryTest/parquet-late-materialization-unique-db.test M tests/query_test/test_parquet_late_materialization.py 4 files changed, 29 insertions(+), 13 deletions(-) git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/79/23779/2 -- To view, visit http://gerrit.cloudera.org:8080/23779 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: newchange Gerrit-Change-Id: Idac138ffe4e1a9260f9080a97a1090b467781d00 Gerrit-Change-Number: 23779 Gerrit-PatchSet: 2 Gerrit-Owner: Xuebin Su <[email protected]>
