Hi Dan, I proposed the same last year while working on partition stats. I can revive this PR if required, https://github.com/apache/iceberg/pull/8500
But we decided that `*iceberg-data`* can write these parquet stats files (metadata) and core can just register it. So, it is no longer needed for partition stats. a) Do we have any strong use case or feature that requires it now? b) I hope we do the same for ORC as well as it looks odd to have a module for that? - Ajantha On Sat, Dec 7, 2024 at 5:22 AM Daniel Weeks <dwe...@apache.org> wrote: > Everyone, > > I wanted to propose moving the parquet implementation from the > 'iceberg-parquet' project to the 'iceberg-core' project. > > The original motivation for keeping these subprojects separate was due to > Iceberg relying on avro (which is included in the core project) for > metadata and keeping other format implementations separate, but as we > consider adding support for partition stats and parquet metadata, we need > the ability to read and write parquet from core library. > > I've created a draft PR <https://github.com/apache/iceberg/pull/11716> of > the proposed changes, which relocates relatively cleanly, but wanted to > discuss whether there are concerns or other considerations for keeping them > separate. > > -Dan >