hello all,
I wanted to kick-start the process of coming up with a standardized /
canonical metadata specification that we can use for describing Arrow
data to be moved between systems. This breaks down into at least two
distinct kinds of metadata
1) "Schemas": physical types, logical types, child
hi Dmitriy,
I created the following JIRA
https://issues.apache.org/jira/browse/SPARK-13534 related to PySpark
which seems relevant. I would be happy to collaborate with you on
this. Since I understand that the Spark developers are exploring an
in-memory columnar layout for Spark DataFrames/Dataset