Re: C++ Write Schema with RecordBatchStreamWriter

Wes McKinney Mon, 15 Jun 2020 06:26:23 -0700

Can you show the code you are writing? The first thing the stream writer
does before writing any record batch is write the schema. It sounds like
you are using arrow::ipc::WriteRecordBatch somewhere.


On Sun, Jun 14, 2020, 11:44 PM Rares Vernica <rvern...@gmail.com> wrote:

> Hello,
>
> I have a RecordBatch that I would like to write to a file. I'm using
> FileOutputStream::Open to open the file and RecordBatchStreamWriter::Open
> to open the stream. I write a record batch with WriteRecordBatch. Finally,
> I close the RecordBatchWriter and OutputStream.
>
> The resulting file size is exactly the size of the Buffer used to store the
> RecordBatch. It looks like it is missing the schema. When I try to open the
> resulting file from PyArrow I get:
>
> >>> pa.ipc.open_file('/tmp/1')
> pyarrow.lib.ArrowInvalid: File is too small: 6
>
> $ ll /tmp/1
> -rw-r--r--. 1 root root 720 Jun 15 03:54 /tmp/1
>
> How can I write the schema as well?
>
> I was browsing the documentation at
> https://arrow.apache.org/docs/cpp/index.html but I can't locate any C++
> documentation about RecordBatchStreamWriter or RecordBatchWriter. Is this
> intentional?
>
> Thank you!
> Rares
>

Re: C++ Write Schema with RecordBatchStreamWriter

Reply via email to