Peter Rozsa created IMPALA-13932:
------------------------------------

             Summary: MERGE duplicate check reports false-positive if the 
incoming row batch's memory is reused
                 Key: IMPALA-13932
                 URL: https://issues.apache.org/jira/browse/IMPALA-13932
             Project: IMPALA
          Issue Type: Bug
          Components: Backend
    Affects Versions: Impala 4.5.0
            Reporter: Peter Rozsa
            Assignee: Peter Rozsa


The Iceberg merge node uses a duplicate check mechanism that compares the 
actual target row's pointer with the previous target row's pointer. If a new 
row batch's first target table's tuple points to the same region as the 
previous row, then we report a duplicate row erroneously.

The duplicate check should be aware whether the merge join's probe batch is 
ended, and this case, resetting the incoming row batch would solve the problem.

 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to