[ 
https://issues.apache.org/jira/browse/ARROW-257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15420729#comment-15420729
 ] 

Wes McKinney commented on ARROW-257:
------------------------------------

So if I understand correctly, support we had a union of 50 types, but only 5 of 
them actually occur in the data, then the typeIds would indicate the indices of 
the observed child types. That makes sense to me.

> Add a typeids Vector to Union type
> ----------------------------------
>
>                 Key: ARROW-257
>                 URL: https://issues.apache.org/jira/browse/ARROW-257
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Format
>            Reporter: Julien Le Dem
>
> {noformat}
> enum UnionMode:int { Sparse, Dense }
> table Union {
>   mode: UnionMode;
>   typeIds: [Int32]; // optional, describes typeid of each child.
> }
> {noformat}
> The idea is to enable providing an id different from the child offset (the 
> default)
> This enables an optimization where we use predefined ids when constructing 
> the type vector of the union but want the children to be only the actually 
> used types.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to