[ https://issues.apache.org/jira/browse/HIVE-25142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated HIVE-25142: ---------------------------------- Labels: pull-request-available (was: ) > Rehashing in map join fast hash table causing corruption for large keys > ------------------------------------------------------------------------ > > Key: HIVE-25142 > URL: https://issues.apache.org/jira/browse/HIVE-25142 > Project: Hive > Issue Type: Bug > Components: Hive > Reporter: mahesh kumar behera > Assignee: mahesh kumar behera > Priority: Major > Labels: pull-request-available > Time Spent: 10m > Remaining Estimate: 0h > > In map join the hash table is created using the keys. To support rehashing, > the keys are stored in write buffer. The hash table contains the offset of > the keys along with the hash code. When rehashing is done, the offset is > extracted from the hash table and then hash code is generated again. For > large keys of size greater than 255, the key length is also stored along with > the key. In case of fast hash table implementation the way key is extracted > is not proper. There is a code bug and thats causing the wrong key to be > extracted and causing wrong hash code generation. This is causing the > corruption in the hash table. -- This message was sent by Atlassian Jira (v8.3.4#803005)