Hi Devin, The schema-id is stored in the Manifest Avro header: https://iceberg.apache.org/spec/#manifests Also the schema itself is stored there. Would that help your situation? I think this makes adding it to the data file redundant.
Kind regards, Fokko Op vr 14 feb 2025 om 17:56 schreef Devin Smith <devinsm...@deephaven.io.invalid>: > I want to make sure I'm not missing something that already exists; > otherwise, hoping to get a quick thumbs up / thumbs down on a potential > proposal before spending more time on it. > > It would be nice to know what Iceberg schema a writer used (/assumed) when > writing a DataFile. Oftentimes, this information is written into the > parquet file's metadata, but it would be great if Iceberg provided this > directly. A schema_id on DataFile would be nice, I think. > > Thanks, > -Devin >