[jira] [Commented] (FLINK-10929) Add support for Apache Arrow

Kurt Young (JIRA) Fri, 12 Apr 2019 08:17:19 -0700


    [ 
https://issues.apache.org/jira/browse/FLINK-10929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16816345#comment-16816345
 ]


Kurt Young commented on FLINK-10929:
------------------------------------

I'm not sure everyone who have already involved to this discussion have a clean 
and common goal about introducing Apache Arrow to Flink. As Stephan said, there 
are two scenarios which can be considered. 

Regarding (2): I think making Arrow as a vectorized execution data format will 
involves lots of changes, from runtime to operator and query optimizer. We 
should at first have consensus about the final goal and status of this. Whether 
streaming can benefits from vectorized execution? Will this break the 
unification of streaming and batch? How many benefits we can gain from it... 
There are lots of unanswered questions. 

> Add support for Apache Arrow
> ----------------------------
>
>                 Key: FLINK-10929
>                 URL: https://issues.apache.org/jira/browse/FLINK-10929
>             Project: Flink
>          Issue Type: Wish
>          Components: Runtime / State Backends
>            Reporter: Pedro Cardoso Silva
>            Priority: Minor
>         Attachments: image-2019-04-10-13-43-08-107.png
>
>
> Investigate the possibility of adding support for Apache Arrow as a 
> standardized columnar, memory format for data.
> Given the activity that [https://github.com/apache/arrow] is currently 
> getting and its claims objective of providing a zero-copy, standardized data 
> format across platforms, I think it makes sense for Flink to look into 
> supporting it.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Commented] (FLINK-10929) Add support for Apache Arrow

Reply via email to