[DISCUSS] June board report

2021-06-15 Thread Ryan Blue
Hi everyone, Time for another board report. I’m a bit late this month so we may need to report next month as well. Let me know if there are any updates or comments! Ryan Description: Apache Iceberg is a table format for huge analytic datasets that is designed for high performance and ease of use

Metadata files not cleaned up in Spark

2021-06-15 Thread Peter Giles
I'm exploring the use of Iceberg (0.11.1) in a spark app, and I'm very enthusiastic about the features, but I am seeing something unexpected in my testing: there is an ever increasing accumulation of v#.metadata.json files for my test table, and I am not seeing them get cleaned up when using any of

Re: Metadata files not cleaned up in Spark

2021-06-15 Thread Ryan Blue
Peter, Iceberg keeps track of the last few metadata files and cleans them up after it reaches a configurable max, which I think is defaulted to 100. On Tue, Jun 15, 2021 at 5:58 PM Peter Giles wrote: > I'm exploring the use of Iceberg (0.11.1) in a spark app, and I'm very > enthusiastic about th

Re: Iceberg 0.12 release ETA

2021-06-15 Thread Justin Mclean
Hi, > We haven't set a date for the 0.12 release yet, but we're going to be > discussing this in the Iceberg sync tomorrow if you'd like to join. Was what was discussed at the sync brought back to the mailing list? Remember not everyone in your community can attend synchronous meetings and may m

Re: Metadata files not cleaned up in Spark

2021-06-15 Thread Vivekanand Vellanki
Ryan, Does this also mean that the metadata.json files will only contain information about the last 100 snapshots, thus preventing the metadata.json files from becoming very large? On Wed, Jun 16, 2021 at 6:30 AM Ryan Blue wrote: > Peter, Iceberg keeps track of the last few metadata files and cl