[ https://issues.apache.org/jira/browse/ARROW-5258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rok Mihevc updated ARROW-5258: ------------------------------ External issue URL: https://github.com/apache/arrow/issues/21729 > [C++/Python] Expose file metadata of dataset pieces to caller > ------------------------------------------------------------- > > Key: ARROW-5258 > URL: https://issues.apache.org/jira/browse/ARROW-5258 > Project: Apache Arrow > Issue Type: Improvement > Components: C++, Python > Reporter: Pearu Peterson > Assignee: Pearu Peterson > Priority: Major > Labels: parquet, pull-request-available > Fix For: 0.14.0 > > Time Spent: 1h > Remaining Estimate: 0h > > This issue resolves partly the issue raised in ARROW-1983 by providing a way > to collect FileMetaData objects of dataset pieces created via > `write_to_dataset` function: > {noformat} > metadata_list = [] > write_to_dataset(table, ..., metadata_collector=metadata_list) > {noformat} > where `metadata_list` will contain instances of file metadata objects of > dataset pieces. -- This message was sent by Atlassian Jira (v8.20.10#820010)