Dear community:

I am using SparkTableUtil to import an existing Hive table to an Iceberg
table.
The ORC files of Hive table is an old version of ORC, so I set a name
mapping (like: id 1 mapped to _col0 and id 2 mapped to _col1...) to the
Iceberg table by using "schema.name-mapping.default" so that the matrics of
ORC files could be built correctly during the import process.

After that, I plan to write new data into the Iceberg table (using the ORC
version 1.6.5 in the iceberg package), how could I deal with that name
mapping used for importing ? Should I remove that? Does that name mapping
do any harm when reading/writing from/to the new ORC file?

I am not sure if we need a per-data file name mapping setting here in
additional to the default name mapping for the whole table level?



-- 

                                               李响 Xiang Li


邮件 e-mail      :wate...@gmail.com

Reply via email to