Prashant Wason created HUDI-3495:
------------------------------------

             Summary: Reading keys in parallel from 
HoodieMetadataMergedLogRecordReader may lead to empty results even if key exists
                 Key: HUDI-3495
                 URL: https://issues.apache.org/jira/browse/HUDI-3495
             Project: Apache Hudi
          Issue Type: Bug
            Reporter: Prashant Wason


[HoodieMetadataMergedLogRecordReader|https://github.com/apache/hudi/blob/master/hudi-common/src/main/java/org/apache/hudi/metadata/HoodieMetadataMergedLogRecordReader.java]
 has two function which lookup keys:

getRecordByKey(String key) - lookups the key in member variable map "records"
getRecordsByKeys(List<String> keys) - clears member variable map "records" and 
scans the log files again.

If the two functions are called in parallel, the getRecordByKey() may return an 
empty key because the records was cleared in another thread calling 
getRecordsByKeys()






--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to