[ https://issues.apache.org/jira/browse/ARROW-18060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
David Li updated ARROW-18060: ----------------------------- Issue Type: Bug (was: Improvement) > [C++] Writing a dataset with 0 rows doesn't create any files > ------------------------------------------------------------ > > Key: ARROW-18060 > URL: https://issues.apache.org/jira/browse/ARROW-18060 > Project: Apache Arrow > Issue Type: Bug > Components: C++ > Affects Versions: 9.0.0 > Reporter: David Li > Priority: Major > > If the input data has no rows, no files get created. This is potentially > unexpected as it looks like "nothing happened". It might be nicer to create > an empty file. With partitioning, though, that then gets weird (there's no > partition values) so maybe an error might make more sense instead. > Reproduction in Python > {code:python} > import tempfile > from pathlib import Path > import pyarrow > import pyarrow.dataset > print("PyArrow version:", pyarrow.__version__) > table = pyarrow.table([ > [], > ], schema=pyarrow.schema([ > ("ints", "int64"), > ])) > with tempfile.TemporaryDirectory() as d: > pyarrow.dataset.write_dataset(table, d, format="feather") > print(list(Path(d).iterdir())) > {code} > Output > {noformat} > > python repro.py > PyArrow version: 9.0.0 > [] {noformat} -- This message was sent by Atlassian Jira (v8.20.10#820010)