[jira] [Created] (ARROW-1974) PyArrow segfaults when working with Arrow tables with duplicate columns

2018-01-06 Thread Alexey Strokach (JIRA)
Alexey Strokach created ARROW-1974: -- Summary: PyArrow segfaults when working with Arrow tables with duplicate columns Key: ARROW-1974 URL: https://issues.apache.org/jira/browse/ARROW-1974 Project: Ap

[jira] [Created] (ARROW-1973) Memory leak when converting Arrow tables with array columns to Pandas dataframes.

2018-01-06 Thread Alexey Strokach (JIRA)
Alexey Strokach created ARROW-1973: -- Summary: Memory leak when converting Arrow tables with array columns to Pandas dataframes. Key: ARROW-1973 URL: https://issues.apache.org/jira/browse/ARROW-1973 P

[jira] [Created] (ARROW-1972) Deserialization of buffer objects (and pandas dataframes) segfaults on different processes.

2018-01-06 Thread Robert Nishihara (JIRA)
Robert Nishihara created ARROW-1972: --- Summary: Deserialization of buffer objects (and pandas dataframes) segfaults on different processes. Key: ARROW-1972 URL: https://issues.apache.org/jira/browse/ARROW-1972

[jira] [Created] (ARROW-1971) Add pandas serialization to the default

2018-01-06 Thread Devin Petersohn (JIRA)
Devin Petersohn created ARROW-1971: -- Summary: Add pandas serialization to the default Key: ARROW-1971 URL: https://issues.apache.org/jira/browse/ARROW-1971 Project: Apache Arrow Issue Type:

How to get "standard" binary columns out of a pyarrow table

2018-01-06 Thread Eli
Hi, I'm looking to send "regular" columnar binary data to a database, the kind that gets created by struct.pack, array.array, numpy.tobytes or str.encode. The origin is parquet files, which I'm reading ever so comfortably via PyArrow. I do however need to deserialize to Python objcets, currentl