Rob DiCiuccio created ARROW-7855: ------------------------------------ Summary: TypeError on mixed array values Key: ARROW-7855 URL: https://issues.apache.org/jira/browse/ARROW-7855 Project: Apache Arrow Issue Type: Bug Components: Python Affects Versions: 0.15.1, 0.16.0 Reporter: Rob DiCiuccio
The following data structure passed to `pa.array` raises a generic `TypeError`: {code:java} import pyarrow as pa pa.array([{'TestKey': [123456, 'foo']}]) {code} {code:java} Traceback (most recent call last): File "pyarrow_list_test.py", line 30, in <module> pa_array = pa.array([\{'TestKey': [123456, 'foo']}]) File "pyarrow/array.pxi", line 269, in pyarrow.lib.array File "pyarrow/array.pxi", line 38, in pyarrow.lib._sequence_to_array TypeError: an integer is required (got type str) {code} I understand there may be a way to overcome this by setting the `type` value as an argument to `pa.array`, but the use case here is storing results of a SQL query where the structure/type of the column is unknown. If Arrow is ultimately unable to handle this data structure without a predefined `type` passed to `pa.array`, can the exception at least us the PyArrow namespace (e.g. `pa.lib.ArrowTypeError` or `pa.lib.ArrowNotImplementedError). Any other workaround suggestions welcome. -- This message was sent by Atlassian Jira (v8.3.4#803005)