Hi, all. Currently, in arrow-java, to do compression for one
ArrowRecordBatch in VectorUnloader, it will separately compress each
ArrowBuffer within the FieldVector instead of compress at the FieldVector
level. From the compression rate perspective, larger batches generally
result in higher compression rates. Additionally, calling
compress(BufferAllocator allocator, ArrowBuf uncompressedBuffer) multiple
times may consume more CPU than call once.
Therefore, I would like to ask if there will be support for overall
compression at the FieldVector level, which could improve the compression
ratio without affecting the ability to read individual columns.

Many thanks,
Yunhong Zheng

Reply via email to