Sergey Shelukhin created HIVE-9790: -------------------------------------- Summary: Hybrid Hybrid Grace Hash Join: improve side file serialization Key: HIVE-9790 URL: https://issues.apache.org/jira/browse/HIVE-9790 Project: Hive Issue Type: Improvement Reporter: Sergey Shelukhin
We have discussed it in the past; the current method is very wasteful, if serializes helper object for each row (so there's expensive serialization and also bunch of unneeded stuff serialized), whereas at the "memory-insert vs side-file-spill" decision point we can produce bytes that are directly usable by hashtable in one method call. So we should do that... at load point, again there's no expensive deserialization, and no helpers, bytes can go into hashtable directly pretty much -- This message was sent by Atlassian JIRA (v6.3.4#6332)