Re: [VOTE] Release Apache Iceberg 0.10.0 RC4

2020-11-05 Thread Mass Dosage
+1 non-binding on RC4. I tested out the Hive read path on a distributed cluster using HadoopTables. On Thu, 5 Nov 2020 at 04:46, Dongjoon Hyun wrote: > +1 for 0.10.0 RC4. > > Bests, > Dongjoon. > > On Wed, Nov 4, 2020 at 7:17 PM Jingsong Li wrote: > >> +1 >> >> 1. Download the source tarball, s

About importing Hive tables and name mapping

2020-11-05 Thread 李响
Dear community: I am using SparkTableUtil to import an existing Hive table to an Iceberg table. The ORC files of Hive table is an old version of ORC, so I set a name mapping (like: id 1 mapped to _col0 and id 2 mapped to _col1...) to the Iceberg table by using "schema.name-mapping.default" so that

Re: [VOTE] Release Apache Iceberg 0.10.0 RC4

2020-11-05 Thread Edgar Rodriguez
+1 non-binding for RC4. Tested with internal tests in cluster, validated Spark write and Hive reads. On Thu, Nov 5, 2020 at 5:56 AM Mass Dosage wrote: > +1 non-binding on RC4. I tested out the Hive read path on a distributed > cluster using HadoopTables. > > On Thu, 5 Nov 2020 at 04:46, Dongjoon

Re: About importing Hive tables and name mapping

2020-11-05 Thread Edgar Rodriguez
Hi Xiang, On Thu, Nov 5, 2020 at 11:07 AM 李响 wrote: > Dear community: > > I am using SparkTableUtil to import an existing Hive table to an Iceberg > table. > The ORC files of Hive table is an old version of ORC, so I set a name > mapping (like: id 1 mapped to _col0 and id 2 mapped to _col1...) t

Re: [VOTE] Release Apache Iceberg 0.10.0 RC4

2020-11-05 Thread Anton Okolnychyi
+1 (binding) Verified the RC locally. - Anton > On 5 Nov 2020, at 08:15, Edgar Rodriguez > wrote: > > +1 non-binding for RC4. Tested with internal tests in cluster, validated > Spark write and Hive reads. > > On Thu, Nov 5, 2020 at 5:56 AM Mass Dosage > wrote:

Re: About importing Hive tables and name mapping

2020-11-05 Thread Ryan Blue
Edgar is correct. Name mapping is used if a data file has no field ids. When you import data with a name mapping, you should leave it configured on the table so that you can read the data files that you imported. There's no need for a different mapping because we assume that the files you add to t

Re: [VOTE] Release Apache Iceberg 0.10.0 RC4

2020-11-05 Thread Daniel Weeks
+1 (binding) Verified all steps. On Thu, Nov 5, 2020 at 8:31 AM Anton Okolnychyi wrote: > +1 (binding) > > Verified the RC locally. > > - Anton > > On 5 Nov 2020, at 08:15, Edgar Rodriguez < > edgar.rodrig...@airbnb.com.INVALID> wrote: > > +1 non-binding for RC4. Tested with internal tests in c

Re: [VOTE] Release Apache Iceberg 0.10.0 RC4

2020-11-05 Thread John Zhuge
+1 (non-binding) Verified all steps. Just a bit of heads up in case anyone encounters the same issue, when running unit tests on my laptop, my local custom HADOOP_HOME and HADOOP_CONF_DIR settings somehow caused 30 iceberg-flink unit tests to fail and "TestFlinkIcebergSink" to hang. After unsetti

Re: [VOTE] Release Apache Iceberg 0.10.0 RC4

2020-11-05 Thread Ryan Blue
I'm changing my vote to -1 because I think we should get #1730 fixed. This is a problem with row-level deletes so it doesn't affect v1 tables, but I think we want people to start building on top of 0.10.0 for v2 tables, so I think we should release with