Re: Positional delete with vs without the delete row values

2022-05-09 Thread Jack Ye
I think there is not much technical issue for Trino to support writing position delete files with row data, because the old rows can be provided in the page scanned to the update/delete note as additional channels. The tradeoff is basically the CDC capability vs efficiency and delete file size. My

Re: Positional delete with vs without the delete row values

2022-05-09 Thread Piotr Findeisen
Hi Peter, FWIW, Trino Iceberg connector writes deletion files with just positions, without row data. cc @Alexander Jo > For the 1st point we just need to collect the statistics during the delete, but we do not have to actually persist the data. I would be weary of creating ORC/Parquet files wit

Positional delete with vs without the delete row values

2022-05-05 Thread Peter Vary
Hi Team, We are working on integrating Iceberg V2 tables with Hive, and enabling delete and update operations. The delete is implemented by Marton and the first version is already merged: https://issues.apache.org/jira/browse/HIVE-26102 The upd