[ https://issues.apache.org/jira/browse/HIVE-1738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12923397#action_12923397 ]
Siying Dong commented on HIVE-1738: ----------------------------------- One note: for the query above, input format is SequenceFile, which is not friendly to this kind of query. I convert the input to RCFile and do the same comparison against it, I can see Map's CPU_MILLISECONDS are improved from about 1,050,000 to about 965,000. > Optimize Key Comparison in GroupByOperator > ------------------------------------------ > > Key: HIVE-1738 > URL: https://issues.apache.org/jira/browse/HIVE-1738 > Project: Hive > Issue Type: Improvement > Reporter: Siying Dong > Assignee: Siying Dong > Attachments: HIVE.1738.1.patch > > > GroupByOperator uses ObjectInspectorUtils.compare() to compare keys, which is > written for generalized object comparisons, which is not optimized for > group-by operator. By optimizing this logic, we expect to see obvious > improvements in GroupByOperator. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.