Prashant Wason created HUDI-3495:
------------------------------------
Summary: Reading keys in parallel from
HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists
Key: HUDI-3495
URL: https://issues.apache.org/jira/browse/HUDI-3495
Project: Apache Hudi
Issue Type: Bug
Reporter: Prashant Wason
[HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
has two function which lookup keys:
getRecordByKey(String key) - lookups the key in member variable map "records"
getRecordsByKeys(List<String> keys) - clears member variable map "records" and
scans the log files again.
If the two functions are called in parallel, the getRecordByKey() may return an
empty key because the records was cleared in another thread calling
getRecordsByKeys()
--
This message was sent by Atlassian Jira
(v8.20.1#820001)