[ https://issues.apache.org/jira/browse/ARROW-369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Uwe L. Korn updated ARROW-369: ------------------------------ Description: Instead of only being able to only convert single single record batches and tables that consist only of single ColumnChunks, we should also support the construction of Pandas DataFrames from multiple RecordBatches. In the most simple way, we would convert each batch to a Pandas DataFrame and then concat them all together. A second (and preferred) implementation would extend the C++ function {{ConvertColumnToPandas}} in {{python/src/pyarrow/adapters/pandas.*}} to work on chunked columns. (was: We can currently convert single record batches, and tables (which have chunked columns). ) > [Python] Add ability to convert multiple record batches at once to pandas > ------------------------------------------------------------------------- > > Key: ARROW-369 > URL: https://issues.apache.org/jira/browse/ARROW-369 > Project: Apache Arrow > Issue Type: New Feature > Components: Python > Reporter: Wes McKinney > Labels: newbie > > Instead of only being able to only convert single single record batches and > tables that consist only of single ColumnChunks, we should also support the > construction of Pandas DataFrames from multiple RecordBatches. In the most > simple way, we would convert each batch to a Pandas DataFrame and then concat > them all together. A second (and preferred) implementation would extend the > C++ function {{ConvertColumnToPandas}} in > {{python/src/pyarrow/adapters/pandas.*}} to work on chunked columns. -- This message was sent by Atlassian JIRA (v6.3.4#6332)