Rob DiCiuccio created ARROW-7855:
------------------------------------

             Summary: TypeError on mixed array values
                 Key: ARROW-7855
                 URL: https://issues.apache.org/jira/browse/ARROW-7855
             Project: Apache Arrow
          Issue Type: Bug
          Components: Python
    Affects Versions: 0.15.1, 0.16.0
            Reporter: Rob DiCiuccio


The following data structure passed to `pa.array` raises a generic `TypeError`:
{code:java}
import pyarrow as pa
pa.array([{'TestKey': [123456, 'foo']}])
{code}
{code:java}
Traceback (most recent call last):
 File "pyarrow_list_test.py", line 30, in <module>
 pa_array = pa.array([\{'TestKey': [123456, 'foo']}])
 File "pyarrow/array.pxi", line 269, in pyarrow.lib.array
 File "pyarrow/array.pxi", line 38, in pyarrow.lib._sequence_to_array
TypeError: an integer is required (got type str)
{code}

I understand there may be a way to overcome this by setting the `type` value as 
an argument to `pa.array`, but the use case here is storing results of a SQL 
query where the structure/type of the column is unknown.

If Arrow is ultimately unable to handle this data structure without a 
predefined `type` passed to `pa.array`, can the exception at least us the 
PyArrow namespace (e.g. `pa.lib.ArrowTypeError` or 
`pa.lib.ArrowNotImplementedError).

Any other workaround suggestions welcome.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to