Re: R arrow first steps

2017-07-27 Thread Clark Fitzgerald
> would you be open to working within an R/ subdirectory in the Arrow codebase? Sure, I'll do whatever is most convenient for the team. A branch sounds fine. Here's one from my end: https://github.com/clarkfitzg/arrow/tree/R/R Thanks for the pointers and encouragement. On Thu, Jul 27, 2017 at 5:

Re: [DISCUSS] The road from Arrow 0.5.0 to 1.0.0

2017-07-27 Thread Leif Walsh
I think Wes' idea that major versions indicate stability of the spec and minor versions indicate stability of each implementation's API makes sense. With that in mind, maybe before 1.0 of the spec we should just establish, within each of the reference language implementations, a mechanism for speci

[jira] [Created] (ARROW-1290) [C++] Use array capacity doubling in arrow::BufferBuilder

2017-07-27 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-1290: --- Summary: [C++] Use array capacity doubling in arrow::BufferBuilder Key: ARROW-1290 URL: https://issues.apache.org/jira/browse/ARROW-1290 Project: Apache Arrow

Re: R arrow first steps

2017-07-27 Thread Wes McKinney
hi Clark, Cool! Before you go too far down the rabbit hole, would you be open to working within an R/ subdirectory in the Arrow codebase? It doesn't have to be ready-to-ship software, and we are happy to set up a branch in the repository for you to experiment so you don't have to worry about bothe

R arrow first steps

2017-07-27 Thread Clark Fitzgerald
I've got at least a "hello world" for R / Arrow bindings in progress. https://github.com/clarkfitzg/Rarrow Over the next couple weeks I plan to spend some time looking at the Arrow C++ and Python sources and write a few bindings by hand, then think about how to automatically generate bindings from

[jira] [Created] (ARROW-1289) [Python] Add PYARROW_BUILD_PLASMA option like Parquet

2017-07-27 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-1289: --- Summary: [Python] Add PYARROW_BUILD_PLASMA option like Parquet Key: ARROW-1289 URL: https://issues.apache.org/jira/browse/ARROW-1289 Project: Apache Arrow Issu

Re: [DISCUSS] The road from Arrow 0.5.0 to 1.0.0

2017-07-27 Thread Julian Hyde
Semantic versioning is a great tool, and we should use it as far as it goes, but not push it. I suggest that the Arrow specification should have a paragraph that states the level of maturity of each part of the API; and each implementation should have a paragraph that states which parts of the spe

[jira] [Created] (ARROW-1288) Clean up many ASF license headers

2017-07-27 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-1288: --- Summary: Clean up many ASF license headers Key: ARROW-1288 URL: https://issues.apache.org/jira/browse/ARROW-1288 Project: Apache Arrow Issue Type: Improvement

[jira] [Created] (ARROW-1287) [Python] Emulate "whence" argument of seek in NativeFile

2017-07-27 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-1287: --- Summary: [Python] Emulate "whence" argument of seek in NativeFile Key: ARROW-1287 URL: https://issues.apache.org/jira/browse/ARROW-1287 Project: Apache Arrow I

[jira] [Created] (ARROW-1286) PYTHON: support Categorical serialization to/from parquet

2017-07-27 Thread Jeff Reback (JIRA)
Jeff Reback created ARROW-1286: -- Summary: PYTHON: support Categorical serialization to/from parquet Key: ARROW-1286 URL: https://issues.apache.org/jira/browse/ARROW-1286 Project: Apache Arrow Is

[jira] [Created] (ARROW-1285) NotImplemented exception creates empty parquet file

2017-07-27 Thread Jeff Reback (JIRA)
Jeff Reback created ARROW-1285: -- Summary: NotImplemented exception creates empty parquet file Key: ARROW-1285 URL: https://issues.apache.org/jira/browse/ARROW-1285 Project: Apache Arrow Issue Ty