[jira] [Commented] (HIVE-15199) INSERT INTO data on S3 is replacing the old rows with the new ones

Steve Loughran (JIRA) Tue, 15 Nov 2016 10:51:06 -0800

    [ 
https://issues.apache.org/jira/browse/HIVE-15199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15667931#comment-15667931
 ]


Steve Loughran commented on HIVE-15199:
---------------------------------------

sounds related to HADOOP-13402

I am not going to express any opinion about what is "the correct" behaviour we 
should expect from rename, as I don't think anyone knows that. If you look at 
the [FS 
Specification|https://hadoop.apache.org/docs/stable2/hadoop-project-dist/hadoop-common/filesystem/filesystem.html]
 we're pretty explicit that rename is hard, and that there are different 
behaviours by different filesystems are.

I'm not defending S3A here, just noting I'm not 100% sure of what HDFS does 
itself here, and how that compares to the semantics of posix's rename call 
(which is different from the unix command line {{mv}} operation).

> INSERT INTO data on S3 is replacing the old rows with the new ones
> ------------------------------------------------------------------
>
>                 Key: HIVE-15199
>                 URL: https://issues.apache.org/jira/browse/HIVE-15199
>             Project: Hive
>          Issue Type: Bug
>          Components: Hive
>            Reporter: Sergio Peña
>            Assignee: Sergio Peña
>            Priority: Critical
>
> Any INSERT INTO statement run on S3 tables and when the scratch directory is 
> saved on S3 is deleting old rows of the table.
> {noformat}
> hive> set hive.blobstore.use.blobstore.as.scratchdir=true;
> hive> create table t1 (id int, name string) location 's3a://spena-bucket/t1';
> hive> insert into table t1 values (1,'name1');
> hive> select * from t1;
> 1       name1
> hive> insert into table t1 values (2,'name2');
> hive> select * from t1;
> 2       name2
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HIVE-15199) INSERT INTO data on S3 is replacing the old rows with the new ones

Reply via email to