Xavier Lacroze created ARROW-7367: ------------------------------------- Summary: Use np.full instead of np.array.repeat in ParquetDatasetPiece Key: ARROW-7367 URL: https://issues.apache.org/jira/browse/ARROW-7367 Project: Apache Arrow Issue Type: Improvement Components: Python Reporter: Xavier Lacroze
For small tables (len < 100) execution time is slightly degraded (~ x1.4 at len = 10), for large ones performance gain is huge (exec time ~ x0.04 at len = 100_000) -- This message was sent by Atlassian Jira (v8.3.4#803005)