[ https://issues.apache.org/jira/browse/HIVE-3972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Phabricator updated HIVE-3972: ------------------------------ Attachment: HIVE-3972.D8349.1.patch navis requested code review of "HIVE-3972 [jira] Support using multiple reducer for fetching order by results". Reviewers: JIRA DPAL-1976 Support using multiple reducer for fetching order by results Queries for fetching results which have lastly "order by" clause make final MR run with single reducer, which can be too much. For example, select value, sum(key) as sum from src group by value order by sum; If number of reducer is reasonable, multiple result files could be merged into single sorted stream in the fetcher level. TEST PLAN EMPTY REVISION DETAIL https://reviews.facebook.net/D8349 AFFECTED FILES common/src/java/org/apache/hadoop/hive/conf/HiveConf.java ql/src/java/org/apache/hadoop/hive/ql/exec/FetchOperator.java ql/src/java/org/apache/hadoop/hive/ql/exec/FetchTask.java ql/src/java/org/apache/hadoop/hive/ql/exec/MergeSortingFetcher.java ql/src/java/org/apache/hadoop/hive/ql/exec/RowFetcher.java ql/src/java/org/apache/hadoop/hive/ql/exec/SMBMapJoinOperator.java ql/src/java/org/apache/hadoop/hive/ql/parse/QB.java ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java ql/src/java/org/apache/hadoop/hive/ql/plan/FetchWork.java ql/src/test/queries/clientpositive/orderby_query_bucketing.q ql/src/test/results/clientpositive/orderby_query_bucketing.q.out MANAGE HERALD RULES https://reviews.facebook.net/herald/view/differential/ WHY DID I GET THIS EMAIL? https://reviews.facebook.net/herald/transcript/20325/ To: JIRA, navis > Support using multiple reducer for fetching order by results > ------------------------------------------------------------ > > Key: HIVE-3972 > URL: https://issues.apache.org/jira/browse/HIVE-3972 > Project: Hive > Issue Type: Improvement > Components: Query Processor > Reporter: Navis > Assignee: Navis > Priority: Minor > Attachments: HIVE-3972.D8349.1.patch > > > Queries for fetching results which have lastly "order by" clause make final > MR run with single reducer, which can be too much. For example, > {code} > select value, sum(key) as sum from src group by value order by sum; > {code} > If number of reducer is reasonable, multiple result files could be merged > into single sorted stream in the fetcher level. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira