[jira] [Created] (ARROW-1030) Python: Account for library versioning in parquet-cpp

2017-05-14 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-1030: -- Summary: Python: Account for library versioning in parquet-cpp Key: ARROW-1030 URL: https://issues.apache.org/jira/browse/ARROW-1030 Project: Apache Arrow Issue

[jira] [Created] (ARROW-1029) [Python] Add pyarrow.parquet tests to Appveyor build for Windows

2017-05-14 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-1029: --- Summary: [Python] Add pyarrow.parquet tests to Appveyor build for Windows Key: ARROW-1029 URL: https://issues.apache.org/jira/browse/ARROW-1029 Project: Apache Arrow

Re: Making Arrow 0.3.1 minor release

2017-05-14 Thread Wes McKinney
OK, I've thought a bit more about it and I'm +1 on doing a 0.4.0. There's enough meaningful new features and improvements since 0.3.0 already (almost 50 patches) to merit a small announcement blog. I will plan to cut the RC on Tuesday to give enough time to get in some last minute cleanup patches.

[jira] [Created] (ARROW-1028) [Python] Documentation updates after ARROW-1008

2017-05-14 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-1028: --- Summary: [Python] Documentation updates after ARROW-1008 Key: ARROW-1028 URL: https://issues.apache.org/jira/browse/ARROW-1028 Project: Apache Arrow Issue Type

Re: Improve SparkR collect performance with Arrow

2017-05-14 Thread Wes McKinney
Note I just opened https://github.com/wesm/feather/pull/297 which deletes all of the Feather Python code (using pyarrow as a dependency). On Sun, May 14, 2017 at 2:44 PM, Wes McKinney wrote: > hi Dean, > > In Arrow 0.3 we incorporated the C++ and Python code from wesm/feather > into the Arrow re

Re: Improve SparkR collect performance with Arrow

2017-05-14 Thread Wes McKinney
hi Dean, In Arrow 0.3 we incorporated the C++ and Python code from wesm/feather into the Arrow repo. The Feather format is a simplified version of the Arrow IPC format (which has file/batch and stream flavors), so the ideal approach would be to move the Feather R/Rcpp wrapper code into the Arrow c

Improve SparkR collect performance with Arrow

2017-05-14 Thread Dean Chen
Following up on the discussion from https://issues.apache.org/jira/browse/SPARK-18924. We have internal use cases that would benefit significantly from improved collect performance and would like to kick off a similar proposal/effort to https://issues.apache.org/jira/browse/SPARK-13534 for SparkR.

[jira] [Created] (ARROW-1027) Allow negative indexing in fields/columns on pyarrow Table and Schema objects

2017-05-14 Thread Phillip Cloud (JIRA)
Phillip Cloud created ARROW-1027: Summary: Allow negative indexing in fields/columns on pyarrow Table and Schema objects Key: ARROW-1027 URL: https://issues.apache.org/jira/browse/ARROW-1027 Project:

[jira] [Created] (ARROW-1026) [Python] Start some C++ tests to verify reference counts in returned Python objects

2017-05-14 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-1026: --- Summary: [Python] Start some C++ tests to verify reference counts in returned Python objects Key: ARROW-1026 URL: https://issues.apache.org/jira/browse/ARROW-1026 Proje

Re: Making Arrow 0.3.1 minor release

2017-05-14 Thread Uwe L. Korn
Hello, I would also be in favour of a bugfix release. Although, as we had quite some API changes in https://github.com/apache/arrow/pull/680, I would vote for a 0.4.0 release. Uwe -- Uwe L. Korn uw...@xhochy.com On Sun, May 14, 2017, at 06:02 PM, Wes McKinney wrote: > hi folks, > > I just

[jira] [Created] (ARROW-1025) [Website] Improve changelog on website

2017-05-14 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-1025: --- Summary: [Website] Improve changelog on website Key: ARROW-1025 URL: https://issues.apache.org/jira/browse/ARROW-1025 Project: Apache Arrow Issue Type: Improve

Making Arrow 0.3.1 minor release

2017-05-14 Thread Wes McKinney
hi folks, I just fixed a quite serious memory leak in the Arrow Python bindings: https://github.com/apache/arrow/pull/685 There have been some other bugs fixed since 0.3.0, and since I don't think we have any API changes, I think we could release master as is. We could also do a major version bu

[jira] [Created] (ARROW-1024) Python: Update build time numpy version to 1.10.1

2017-05-14 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-1024: -- Summary: Python: Update build time numpy version to 1.10.1 Key: ARROW-1024 URL: https://issues.apache.org/jira/browse/ARROW-1024 Project: Apache Arrow Issue Type

[jira] [Created] (ARROW-1023) Python: Fix bundling of arrow-cpp for macOS

2017-05-14 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-1023: -- Summary: Python: Fix bundling of arrow-cpp for macOS Key: ARROW-1023 URL: https://issues.apache.org/jira/browse/ARROW-1023 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-1022) [Python] Add nthreads option to Feather read method

2017-05-14 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-1022: --- Summary: [Python] Add nthreads option to Feather read method Key: ARROW-1022 URL: https://issues.apache.org/jira/browse/ARROW-1022 Project: Apache Arrow Issue