[ https://issues.apache.org/jira/browse/ARROW-362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rok Mihevc updated ARROW-362: ----------------------------- External issue URL: https://github.com/apache/arrow/issues/15907 > Python: Calling to_pandas on a table read from Parquet leaks memory > ------------------------------------------------------------------- > > Key: ARROW-362 > URL: https://issues.apache.org/jira/browse/ARROW-362 > Project: Apache Arrow > Issue Type: Bug > Components: Python > Affects Versions: 0.1.0 > Reporter: Uwe Korn > Assignee: Wes McKinney > Priority: Major > Fix For: 0.2.0 > > > Steps to reproduce: > * Read a parquet file with {{pyarrow.parquet.read_table}} and convert the > table to a DataFrame with {{to_pandas}} > * Repeat this several times and see an ever increasing memory usage > This seems to happen only in this combination. Calling {{gc.collect}} doesn't > help. -- This message was sent by Atlassian Jira (v8.20.10#820010)