Jovann Kung created ARROW-1599: ---------------------------------- Summary: PyArrow unable to read Parquet files with vector as column Key: ARROW-1599 URL: https://issues.apache.org/jira/browse/ARROW-1599 Project: Apache Arrow Issue Type: Bug Components: Python Affects Versions: 0.7.0 Environment: Ubuntu Reporter: Jovann Kung Priority: Critical
Is PyArrow currently unable to read in Parquet files with a vector as a column? For example, the schema of such a file is below: {{<pyarrow._parquet.ParquetSchema object at 0x7f2d42493c88> mbc: FLOAT deltae: FLOAT labels: FLOAT features.type: INT32 INT_8 features.size: INT32 features.indices.list.element: INT32 features.values.list.element: DOUBLE}} Using either pq.read_table() or pq.ParquetDataset('/path/to/parquet').read() yields the following error: ArrowNotImplementedError: Currently only nesting with Lists is supported. >From the error I assume that this may be implemented in further releases? -- This message was sent by Atlassian JIRA (v6.4.14#64029)