Re: C++ Migrate from Arrow 0.16.0

Antoine Pitrou Thu, 03 Jun 2021 00:44:29 -0700


It should work as long as the union itself doesn't have nulls:
https://github.com/apache/arrow/blob/master/cpp/src/arrow/ipc/reader.cc#L351

Regards

Antoine.


Le 03/06/2021 à 06:42, Micah Kornfield a écrit :

I think the one place where it might break is for Union types (I seem to
recall a breaking change just prior to 1.0).

On Wed, Jun 2, 2021 at 1:00 PM Antoine Pitrou <anto...@python.org> wrote:


Le 02/06/2021 à 21:57, Rares Vernica a écrit :

Thanks for the pointers! The migration is going well.

We have been using Arrow 0.16.0 RecordBatchStreamWriter
<

https://github.com/Paradigm4/bridge/blob/master/src/PhysicalXSave.cpp#L450


with & without CompressedOutputStream and wrote the resulting Arrow

Buffer

data to S3
<https://github.com/Paradigm4/bridge/blob/master/src/S3Driver.cpp#L168>

or file

system
<https://github.com/Paradigm4/bridge/blob/master/src/FSDriver.cpp#L156>.

We

have a sizable amount of data saved this way.

Once we upgrade our C++ code to use Arrow 3.0.0 or 4.0.0, will it be
possible to read the Arrow steam files written with Arrow 0.16.0?


It definitely should, unless it's a bug.
That said, for extra safety, I suggest you test loading the files before
doing the final migration.

By the way, for saving lots of data to S3, it may be more efficient to
use Parquet. It will be more CPU-intensive but will result in
significant space savings.

Regards

Antoine.

Re: C++ Migrate from Arrow 0.16.0

Reply via email to