[ https://issues.apache.org/jira/browse/ARROW-4582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17661605#comment-17661605 ]
Rok Mihevc commented on ARROW-4582: ----------------------------------- This issue has been migrated to [issue #21126|https://github.com/apache/arrow/issues/21126] on GitHub. Please see the [migration documentation|https://github.com/apache/arrow/issues/14542] for further details. > [C++/Python] Memory corruption on Pandas->Arrow conversion > ---------------------------------------------------------- > > Key: ARROW-4582 > URL: https://issues.apache.org/jira/browse/ARROW-4582 > Project: Apache Arrow > Issue Type: Bug > Components: C++, Python > Affects Versions: 0.11.0, 0.11.1, 0.12.0 > Reporter: Uwe Korn > Assignee: Uwe Korn > Priority: Major > Labels: pull-request-available > Fix For: 0.12.1, 0.13.0 > > Time Spent: 1h 20m > Remaining Estimate: 0h > > When converting DataFrames with numerical columns to Arrow tables we were > seeing random segfaults in core Python code. This only happened in > environments where we had a high level of parallelisation or slow code > execution (e.g. in AddressSanitizer builds). > The reason for these segfaults was that we were incrementing the reference > count of the underlying NumPy buffer but were not holding the GIL while > changing the reference count. -- This message was sent by Atlassian Jira (v8.20.10#820010)