Sergey Shelukhin created HIVE-9790:
--------------------------------------

             Summary: Hybrid Hybrid Grace Hash Join: improve side file 
serialization
                 Key: HIVE-9790
                 URL: https://issues.apache.org/jira/browse/HIVE-9790
             Project: Hive
          Issue Type: Improvement
            Reporter: Sergey Shelukhin


We have discussed it in the past; the current method is very wasteful, if 
serializes helper object for each row (so there's expensive serialization and 
also bunch of unneeded stuff serialized), whereas at the "memory-insert vs 
side-file-spill" decision point we can produce bytes that are directly usable 
by hashtable in one method call. So we should do that... at load point, again 
there's no expensive deserialization, and no helpers, bytes can go into 
hashtable directly pretty much



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to