Re: Convert hive table to iceberg table

2019-07-03 Thread Anton Okolnychyi
We started talking about a proper API/tool for migrating existing tables in [1] but I do think we created any issues. I’ll attach the email thread to the created issue. Thanks! [1] - https://lists.apache.org/thread.html/0f55e50fe48c15a55b31277b48e70500c642ea50cf8d6a3f12dfafe4@%3Cdev.iceberg.apa

Re: Convert hive table to iceberg table

2019-07-03 Thread 俊杰陈
Thanks, Ryan OK, I will create one for this and you guys can delete if duplicated. I wanted to run spark-sql-perf to verify basic functionality and performance and found that It can't convert from spark/hive table to iceberg table. On Wed, Jul 3, 2019 at 1:07 AM Ryan Blue wrote: > > Right now, w

Re: Convert hive table to iceberg table

2019-07-02 Thread Ryan Blue
Right now, we don't have a utility to convert tables automatically. I don't think there is an open issue for it (though Anton may have opened one). On Tue, Jul 2, 2019 at 9:43 AM 俊杰陈 wrote: > Thanks for the explanation. > > So to migrate a spark table to iceberg table, I can append the > SparkDa

Re: Convert hive table to iceberg table

2019-07-02 Thread 俊杰陈
Thanks for the explanation. So to migrate a spark table to iceberg table, I can append the SparkDataFiles from listParquetPartition to an iceberg table, right? I don't see an existing open issue for this, is this still wokring in progress? On Mon, Jul 1, 2019 at 7:34 PM Anton Okolnychyi wrote:

Re: Convert hive table to iceberg table

2019-07-01 Thread Anton Okolnychyi
Hey, The issue you pointed out is about tracking Iceberg tables in HMS and levering HMS locks to commit metadata instead of relying on renames. This allows Iceberg to reliably manage metadata when it is persisted in object stores. At the same time, it is possible to migrate Spark tables to Iceb