About importing Hive tables and name mapping

李响 Thu, 05 Nov 2020 08:07:41 -0800

Dear community:

I am using SparkTableUtil to import an existing Hive table to an Iceberg
table.
The ORC files of Hive table is an old version of ORC, so I set a name
mapping (like: id 1 mapped to _col0 and id 2 mapped to _col1...) to the
Iceberg table by using "schema.name-mapping.default" so that the matrics of
ORC files could be built correctly during the import process.


After that, I plan to write new data into the Iceberg table (using the ORC
version 1.6.5 in the iceberg package), how could I deal with that name
mapping used for importing ? Should I remove that? Does that name mapping
do any harm when reading/writing from/to the new ORC file?

I am not sure if we need a per-data file name mapping setting here in
additional to the default name mapping for the whole table level?



-- 

                                               李响 Xiang Li


邮件 e-mail      ：wate...@gmail.com

About importing Hive tables and name mapping

Reply via email to