cshuo opened a new issue, #14196:
URL: https://github.com/apache/hudi/issues/14196
### Task Description
**What needs to be done:**
The current pushed-down predicate for base file reader in flink FileGroup
reader can be wrong in some cases, like
// Case:
// * records in base file can not survive from the predicate
// * records in log file can survive from the predicate
// * records in base file have higher ordering value
// E.g., base file: (uuid:'k1', age: 23, ts: 1003)
// log file: (uuid: 'k1', age: 25, ts: 1001)
// query filter: age = 25;
// Then the expected result should be empty, but if predicate age = 25
is pushed down
// into the parquet reader, the result would be wrong as (uuid: 'k1',
age: 25, ts: 1001)
When there is log files in a file slice to read, we should make sure the
predicate contains only primary key fields.
**Why this task is needed:**
### Task Type
Code improvement/refactoring
### Related Issues
**Parent feature issue:** (if applicable )
**Related issues:**
NOTE: Use `Relationships` button to add parent/blocking issues after issue
is created.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]