Just following up on this thread, I was tracking the 1.3.0 release milestone
https://github.com/apache/iceberg/milestone/26 and it seems like we added some
more items today. A few of these PRs look pretty close, but I think the main
blocker is https://github.com/apache/iceberg/issues/7623 . I th
I agree, it totally depends on the way “last modified time” per partition is
implemented.
I’m concerned about performance of computing partition stats (and storage + the
size of table metadata files) if the implementation requires users to keep
around all snapshots. (I described one of my use ca
Thanks Ryan, Russell,
Let me explain the situation a bit further.
We have time series data written to an Iceberg table, then there is a Flink
job which uses this Iceberg table as a source to read the incoming data
continuously.
*Downstream job -> Iceberg table -> Flink job *
The Flink job