[ https://issues.apache.org/jira/browse/ARROW-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17662400#comment-17662400 ]
Rok Mihevc commented on ARROW-5377: ----------------------------------- This issue has been migrated to [issue #21836|https://github.com/apache/arrow/issues/21836] on GitHub. Please see the [migration documentation|https://github.com/apache/arrow/issues/14542] for further details. > [C++] Make IpcPayload public and add GetPayloadSize > --------------------------------------------------- > > Key: ARROW-5377 > URL: https://issues.apache.org/jira/browse/ARROW-5377 > Project: Apache Arrow > Issue Type: Improvement > Components: C++ > Reporter: Wes McKinney > Assignee: David Li > Priority: Major > Labels: pull-request-available > Fix For: 1.0.0 > > Time Spent: 2.5h > Remaining Estimate: 0h > > As discussed in recent mailing list thread > https://lists.apache.org/thread.html/b756209052fecb8c28a5eb37db7aecb82a5f5351fa79a9d86f0dba3e@%3Cuser.arrow.apache.org%3E > The only viable process at the moment for getting an accurate report of > stream size is to write a simulated stream using {{MockOutputStream}}. This > is suboptimal for a couple of reasons: > * Flatbuffers metadata must be created twice > * Record batch disassembly into IpcPayload must be performed twice > It seems like an interface with a very constrained public API could be > provided to deconstruct a sequence of RecordBatches and report the size of > the produced IPC stream (based on metadata sizes, and padding), and then this > deconstructed set of IPC payloads can be written out to a stream (e.g. using > {{FixedSizeBufferWriter}}) -- This message was sent by Atlassian Jira (v8.20.10#820010)