Cross-implementation metadata specification, IPC details

2016-02-29 Thread Wes McKinney
hello all, I wanted to kick-start the process of coming up with a standardized / canonical metadata specification that we can use for describing Arrow data to be moved between systems. This breaks down into at least two distinct kinds of metadata 1) "Schemas": physical types, logical types, child

Re: Arrow examples

2016-02-29 Thread Wes McKinney
hi Dmitriy, I created the following JIRA https://issues.apache.org/jira/browse/SPARK-13534 related to PySpark which seems relevant. I would be happy to collaborate with you on this. Since I understand that the Spark developers are exploring an in-memory columnar layout for Spark DataFrames/Dataset