Navis created HIVE-3290:
---------------------------
Summary: BucketizedHiveInputFormat should support combining files
having same bucket number
Key: HIVE-3290
URL: https://issues.apache.org/jira/browse/HIVE-3290
Project: Hive
Issue Type: Improvement
Components: Query Processor
Affects Versions: 0.10.0
Reporter: Navis
Assignee: Navis
Priority: Minor
Current BucketizedHiveInputFormat creates one split per one input file, which
could result too many map tasks. If input files are not so big (make
configurable threshold?), combining files with same bucket number and same
input format could help reducing total execution time.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira