Hey Gaurav,

Orphan files do not affect Iceberg's performance, since Iceberg performs no
list operations. It will only increase your storage bill since you have
files around that are not relevant anymore. iceberg tables do need periodic
maintenance, for example, it is good to rewrite small files
<https://iceberg.apache.org/docs/latest/spark-procedures/#rewrite_data_files>
into
bigger ones to avoid many calls to your storage.

I hope this answers your question.

Kind regards,
Fokko



Op di 23 mei 2023 om 21:13 schreef Gaurav Agarwal <gaurav130...@gmail.com>:

> Hello
>
> We have orphan files in the table does it impact the read performance of
> the data from the table if we are passing the partition column in the read
> query?
>
> Would like to know what is the impact of orphan files ?
>
> Thanks
>
  • Orphan files Gaurav Agarwal
    • Re: Orphan files Fokko Driesprong

Reply via email to