Mike Pedersen created ARROW-4836:
------------------------------------
Summary: "Cannot tell() a compressed stream" when using
RecordBatchStreamWriter
Key: ARROW-4836
URL: https://issues.apache.org/jira/browse/ARROW-4836
Project: Apache Arrow
Issue Type: Bug
Components: Python
Affects Versions: 0.12.1
Reporter: Mike Pedersen
It does not seem like RecordBatchStreamWriter works with compressed streams:
{code:python}
>>> import pyarrow as pa
>>> pa.__version__
'0.12.1'
>>> stream = pa.output_stream('/tmp/a.gz')
>>> batch = pa.RecordBatch.from_arrays([pa.array([1])], ['a'])
>>> writer = pa.RecordBatchStreamWriter(stream, batch.schema)
>>> writer.write(batch)
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "pyarrow/ipc.pxi", line 181, in pyarrow.lib._RecordBatchWriter.write
File "pyarrow/ipc.pxi", line 196, in
pyarrow.lib._RecordBatchWriter.write_batch
File "pyarrow/error.pxi", line 89, in pyarrow.lib.check_status
pyarrow.lib.ArrowNotImplementedError: Cannot tell() a compressed stream
{code}
As I understand the documentation, this should be possible, right?
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)