[ https://issues.apache.org/jira/browse/HIVE-14269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15388344#comment-15388344 ]
Steve Loughran commented on HIVE-14269: --------------------------------------- People working on this should be using a build of Hadoop 2.8+, with S3A's latest patches # the instrumentation you get at the API level (or just in input stream and filesystem .toString()) values tell you what happened. # Things that may appear to be a problem may not be so bad any more. # things that are still be problem may be addressable in Hadoop, with patches against that latest code > Performance optimizations for data on S3 > ---------------------------------------- > > Key: HIVE-14269 > URL: https://issues.apache.org/jira/browse/HIVE-14269 > Project: Hive > Issue Type: Improvement > Affects Versions: 2.1.0 > Reporter: Sergio Peña > Assignee: Sergio Peña > > Working with tables that resides on Amazon S3 (or any other object store) > have several performance impact when reading or writing data, and also > consistency issues. > This JIRA is an umbrella task to monitor all the performance improvements > that can be done in Hive to work better with S3 data. -- This message was sent by Atlassian JIRA (v6.3.4#6332)