[ 
https://issues.apache.org/jira/browse/ARROW-384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15690537#comment-15690537
 ] 

Wes McKinney commented on ARROW-384:
------------------------------------

I talked this proposal through with Julien yesterday and we should be able to 
manage without additional metadata by treating the contiguous buffer block 
written as part of the file format as a "memory page" for the purposes of 
record batch reconstruction -- so we will not consider the offset from the 
beginning of the physical file to be semantically meaningful. This is distinct 
from the shared memory IPC case where the buffers may occur in many different 
places in one or more shared memory regions. 

I'll write up some details under format/ about this more precisely in the 
course of implementing this in my patch for ARROW-363. 

> Align Java and C++ RecordBatch data and metadata layout
> -------------------------------------------------------
>
>                 Key: ARROW-384
>                 URL: https://issues.apache.org/jira/browse/ARROW-384
>             Project: Apache Arrow
>          Issue Type: Bug
>            Reporter: Julien Le Dem
>
> layout on C++ side:
> {noformat}
> <buffers> <metadata> <metadata size: int32>
> {noformat}
> and on the java side:
> {noformat}
> <metadata> <buffers>
> {noformat}
> In the file format the footer has a Block info that contains the metadata 
> length.
> https://github.com/apache/arrow/blob/f082b17323354dc2af31f39c15c58b995ba08360/format/File.fbs#L36
> See:
> https://github.com/apache/arrow/pull/211#issuecomment-262080545



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to