Mike Pedersen created ARROW-4836: ------------------------------------ Summary: "Cannot tell() a compressed stream" when using RecordBatchStreamWriter Key: ARROW-4836 URL: https://issues.apache.org/jira/browse/ARROW-4836 Project: Apache Arrow Issue Type: Bug Components: Python Affects Versions: 0.12.1 Reporter: Mike Pedersen
It does not seem like RecordBatchStreamWriter works with compressed streams: {code:python} >>> import pyarrow as pa >>> pa.__version__ '0.12.1' >>> stream = pa.output_stream('/tmp/a.gz') >>> batch = pa.RecordBatch.from_arrays([pa.array([1])], ['a']) >>> writer = pa.RecordBatchStreamWriter(stream, batch.schema) >>> writer.write(batch) Traceback (most recent call last): File "<stdin>", line 1, in <module> File "pyarrow/ipc.pxi", line 181, in pyarrow.lib._RecordBatchWriter.write File "pyarrow/ipc.pxi", line 196, in pyarrow.lib._RecordBatchWriter.write_batch File "pyarrow/error.pxi", line 89, in pyarrow.lib.check_status pyarrow.lib.ArrowNotImplementedError: Cannot tell() a compressed stream {code} As I understand the documentation, this should be possible, right? -- This message was sent by Atlassian JIRA (v7.6.3#76005)