Re: Iceberg Transactions via spark

2021-05-04 Thread Ryan Blue
Vivek, You might want to try MERGE INTO again. You should be able to make it more efficient by adding predicates to the ON clause. Those will get pushed down to the target table to avoid a big scan. Iceberg supports transactions to do what you want, but it doesn't use table locking. Instead, it w

dump Kafka to type 1 snapshot

2021-05-04 Thread Lian Jiang
Hi, I want to dump events in a kafka topic into datalake as a type 1 snapshot in iceberg. Type 1 means a record having a key will overwrite the previous record having the same key. Each key will have only one record in the snapshot. Note that I want to simplify the long path: kafka -> (streaming