[ https://issues.apache.org/jira/browse/HIVE-7617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sergey Shelukhin updated HIVE-7617: ----------------------------------- Attachment: HIVE-7617.03.patch Addressed some of the concerns; removed the size changes for buffers, they are already in some other patch. > optimize bytes mapjoin hash table read path wrt serialization, at least for > common cases > ---------------------------------------------------------------------------------------- > > Key: HIVE-7617 > URL: https://issues.apache.org/jira/browse/HIVE-7617 > Project: Hive > Issue Type: Improvement > Reporter: Sergey Shelukhin > Assignee: Sergey Shelukhin > Attachments: HIVE-7617.01.patch, HIVE-7617.02.patch, > HIVE-7617.03.patch, HIVE-7617.patch, HIVE-7617.prelim.patch, > hashmap-wb-fixes.png > > > BytesBytes has table stores keys in the byte array for compact > representation, however that means that the straightforward implementation of > lookups serializes lookup keys to byte arrays, which is relatively expensive. > We can either shortcut hashcode and compare for common types on read path > (integral types which would cover most of the real-world keys), or specialize > hashtable and from BytesBytes... create LongBytes, StringBytes, or whatever. > First one seems simpler now. -- This message was sent by Atlassian JIRA (v6.2#6252)