Re: Java 1.3.0 around mid May?

2023-05-16 Thread Jahagirdar, Amogh
Just following up on this thread, I was tracking the 1.3.0 release milestone https://github.com/apache/iceberg/milestone/26 and it seems like we added some more items today. A few of these PRs look pretty close, but I think the main blocker is https://github.com/apache/iceberg/issues/7623 . I th

RE: [Proposal] Partition stats in Iceberg

2023-05-16 Thread Mayur Srivastava
I agree, it totally depends on the way “last modified time” per partition is implemented. I’m concerned about performance of computing partition stats (and storage + the size of table metadata files) if the implementation requires users to keep around all snapshots. (I described one of my use ca

Re: Scan statistics

2023-05-16 Thread Péter Váry
Thanks Ryan, Russell, Let me explain the situation a bit further. We have time series data written to an Iceberg table, then there is a Flink job which uses this Iceberg table as a source to read the incoming data continuously. *Downstream job -> Iceberg table -> Flink job * The Flink job