[jira] [Commented] (HIVE-14269) Performance optimizations for data on S3

Steve Loughran (JIRA) Thu, 21 Jul 2016 13:12:17 -0700

    [ 
https://issues.apache.org/jira/browse/HIVE-14269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15388344#comment-15388344
 ]


Steve Loughran commented on HIVE-14269:
---------------------------------------

People working on this should be using a build of Hadoop 2.8+, with S3A's 
latest patches

# the instrumentation you get at the API level (or just in input stream and 
filesystem .toString()) values tell you what happened.
# Things that may appear to be a problem may not be so bad any more.
# things that are still be problem may be addressable in Hadoop, with patches 
against that latest code

> Performance optimizations for data on S3
> ----------------------------------------
>
>                 Key: HIVE-14269
>                 URL: https://issues.apache.org/jira/browse/HIVE-14269
>             Project: Hive
>          Issue Type: Improvement
>    Affects Versions: 2.1.0
>            Reporter: Sergio Peña
>            Assignee: Sergio Peña
>
> Working with tables that resides on Amazon S3 (or any other object store) 
> have several performance impact when reading or writing data, and also 
> consistency issues.
> This JIRA is an umbrella task to monitor all the performance improvements 
> that can be done in Hive to work better with S3 data.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-14269) Performance optimizations for data on S3

Reply via email to