Christopher Brooks created ARROW-3030: -----------------------------------------
Summary: from_pandas() does not respect dtype==category Key: ARROW-3030 URL: https://issues.apache.org/jira/browse/ARROW-3030 Project: Apache Arrow Issue Type: Bug Components: Python Affects Versions: 0.10.0 Reporter: Christopher Brooks While troublehsooting ARROW-2966 I updated my pandas dataframe with more type information. Specifically, I changed some mixed type columns to categorical instead of object. I assumed that the Table.from_pandas() would inspect the pandas type information and respect that when converting it over to a table. It doesn't seem to. For instance, I expected this code to work, but it throws the same ArrowTypeError as ARROW-2966. {code:java} import pandas as pd import pyarrow as pa import numpy as np df=pd.DataFrame.from_dict({"col":[0,1,2,3,""]},dtype="category") tb = pa.Table.from_pandas(df, columns=["col"]) {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)