[ https://issues.apache.org/jira/browse/ARROW-3928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rok Mihevc updated ARROW-3928: ------------------------------ External issue URL: https://github.com/apache/arrow/issues/20538 > [Python] Add option to deduplicate PyBytes / PyString / PyUnicode objects in > Table.to_pandas conversion path > ------------------------------------------------------------------------------------------------------------ > > Key: ARROW-3928 > URL: https://issues.apache.org/jira/browse/ARROW-3928 > Project: Apache Arrow > Issue Type: Improvement > Components: Python > Reporter: Wes McKinney > Assignee: Wes McKinney > Priority: Major > Labels: pull-request-available > Fix For: 0.12.0 > > Time Spent: 1.5h > Remaining Estimate: 0h > > While hashing carries a performance penalty, the memory savings can be huge. > See also ARROW-3911 -- we should develop some reusable machinery for > conversions that yield Python objects -- This message was sent by Atlassian Jira (v8.20.10#820010)