[ https://issues.apache.org/jira/browse/HIVE-5102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13763796#comment-13763796 ]
Phabricator commented on HIVE-5102: ----------------------------------- ashutoshc has accepted the revision "HIVE-5102 [jira] ORC getSplits should create splits based the stripes". +1 REVISION DETAIL https://reviews.facebook.net/D12849 BRANCH h-5102 ARCANIST PROJECT hive To: JIRA, ashutoshc, omalley > ORC getSplits should create splits based the stripes > ----------------------------------------------------- > > Key: HIVE-5102 > URL: https://issues.apache.org/jira/browse/HIVE-5102 > Project: Hive > Issue Type: Bug > Components: File Formats > Reporter: Owen O'Malley > Assignee: Owen O'Malley > Attachments: HIVE-5102.D12579.1.patch, HIVE-5102.D12579.2.patch, > HIVE-5102.D12849.1.patch > > > Currently ORC inherits getSplits from FileFormat, which basically makes a > split per an HDFS block. This can create too little parallelism and would be > better done by having getSplits look at the file footer and create splits > based on the stripes. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira