Bryan Cutler created ARROW-7723: ----------------------------------- Summary: [Python] StructArray timestamp type with timezone to_pandas convert error Key: ARROW-7723 URL: https://issues.apache.org/jira/browse/ARROW-7723 Project: Apache Arrow Issue Type: Bug Components: Python Reporter: Bryan Cutler
When a {{StructArray}} has a child that is a timestamp with a timezone, the {{to_pandas}} conversion outputs an int64 instead of a timestamp {code:java} In [1]: import pyarrow as pa ...: import pandas as pd ...: arr = pa.array([{'start': pd.Timestamp.now(), 'end': pd.Timestamp.now()}]) ...: In [2]: arr.to_pandas() Out[2]: 0 {'end': 2020-01-29 11:38:02.792681, 'start': 2... dtype: object In [3]: ts = pd.Timestamp.now() In [4]: arr2 = pa.array([ts], type=pa.timestamp('us', tz='America/New_York')) In [5]: arr2.to_pandas() Out[5]: 0 2020-01-29 06:38:47.848944-05:00 dtype: datetime64[ns, America/New_York] In [6]: arr = pa.StructArray.from_arrays([arr2, arr2], ['start', 'stop']) In [7]: arr.to_pandas() Out[7]: 0 {'start': 1580297927848944000, 'stop': 1580297... dtype: object {code} from https://github.com/apache/arrow/pull/6312 -- This message was sent by Atlassian Jira (v8.3.4#803005)