Wes McKinney created ARROW-8004: ----------------------------------- Summary: [Python] Define API for user-defined conversions of array cell values in pyarrow.array Key: ARROW-8004 URL: https://issues.apache.org/jira/browse/ARROW-8004 Project: Apache Arrow Issue Type: Improvement Components: Python Reporter: Wes McKinney
Consider the statement {code} pyarrow.array([v0, v1, v2, v3]) {code} or correspondingly {code} pyarrow.array(pd.Series([v0, v1, v2, v3], dtype=object)) {code} where {{v0, ..., v4}} are instances of types with no built-in conversion-to-Arrow support in pyarrow. An API could be provided to allow user-defined unboxing to a data type that the library _does_ understand (like a NumPy array). One complexity is that if the unboxing is costly, we may need to "keep around" the unboxed value when doing multiple passes over the data (e.g. initially for type inference and then for conversion) -- This message was sent by Atlassian Jira (v8.3.4#803005)