[jira] [Created] (ARROW-5570) Update Avro C++ code to conform to Arrow style guide and get it compiling.

2019-06-11 Thread Micah Kornfield (JIRA)
Micah Kornfield created ARROW-5570: -- Summary: Update Avro C++ code to conform to Arrow style guide and get it compiling. Key: ARROW-5570 URL: https://issues.apache.org/jira/browse/ARROW-5570 Project:

[jira] [Created] (ARROW-5569) import avro C++ code to code base.

2019-06-11 Thread Micah Kornfield (JIRA)
Micah Kornfield created ARROW-5569: -- Summary: import avro C++ code to code base. Key: ARROW-5569 URL: https://issues.apache.org/jira/browse/ARROW-5569 Project: Apache Arrow Issue Type: Sub-t

Re: [VOTE] Formalizing "Extension Type" metadata in Arrow binary protocol

2019-06-11 Thread Ravindra Pindikura
+1 On Tue, Jun 11, 2019 at 10:24 PM Micah Kornfield wrote: > +1 (non-binding) > > On Tue, Jun 11, 2019 at 6:08 AM Antoine Pitrou wrote: > > > > > Le 10/06/2019 à 22:28, Wes McKinney a écrit : > > > > > > Please vote to accept these changes (see [3] for the actual changes). > > > The vote will b

[jira] [Created] (ARROW-5568) [Python] Allow parsing more general JSON formats

2019-06-11 Thread Dave Hirschfeld (JIRA)
Dave Hirschfeld created ARROW-5568: -- Summary: [Python] Allow parsing more general JSON formats Key: ARROW-5568 URL: https://issues.apache.org/jira/browse/ARROW-5568 Project: Apache Arrow Iss

Re: Avro to Arrow?

2019-06-11 Thread Micah Kornfield
Hi Tim, The avro support in C++ has been on my backlog for a while. I'm going to try to take the first few steps towards this over the next couple of days. Let me know if you want to collaborate on it. C++ is a lot nicer now then it was 8 years ago :) Cheers, Micah On Tue, Jun 11, 2019 at 6:40

[jira] [Created] (ARROW-5567) [C++] Fix build error of memory-benchmark

2019-06-11 Thread Yuqi Gu (JIRA)
Yuqi Gu created ARROW-5567: -- Summary: [C++] Fix build error of memory-benchmark Key: ARROW-5567 URL: https://issues.apache.org/jira/browse/ARROW-5567 Project: Apache Arrow Issue Type: Improvement

[jira] [Created] (ARROW-5566) [Python] Overhaul type unification from Python sequence in arrow::py::InferArrowType

2019-06-11 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-5566: --- Summary: [Python] Overhaul type unification from Python sequence in arrow::py::InferArrowType Key: ARROW-5566 URL: https://issues.apache.org/jira/browse/ARROW-5566 Proj

[jira] [Created] (ARROW-5565) [Python] Document how to use gdb when working on pyarrow

2019-06-11 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-5565: --- Summary: [Python] Document how to use gdb when working on pyarrow Key: ARROW-5565 URL: https://issues.apache.org/jira/browse/ARROW-5565 Project: Apache Arrow I

[jira] [Created] (ARROW-5564) [C++] Add uriparser to conda-forge

2019-06-11 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-5564: --- Summary: [C++] Add uriparser to conda-forge Key: ARROW-5564 URL: https://issues.apache.org/jira/browse/ARROW-5564 Project: Apache Arrow Issue Type: Improvement

Re: Avro to Arrow?

2019-06-11 Thread Tim Swast
Thanks for the advice, Wes. Unfortunately, I am about 8 years out of practice for writing any C++ (which was part of the appeal of numba to me). Sounds like I should refresh my skills. I like the idea of write one, have good performance everywhere. On Tue, Jun 11, 2019 at 3:40 PM Wes McKinney wr

[jira] [Created] (ARROW-5563) [Format] Update integration test JSON format documentation in Metadata.rst

2019-06-11 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-5563: --- Summary: [Format] Update integration test JSON format documentation in Metadata.rst Key: ARROW-5563 URL: https://issues.apache.org/jira/browse/ARROW-5563 Project: Apach

[GitHub] [arrow-site] wesm merged pull request #6: Remove very stale docs/latest

2019-06-11 Thread GitBox
wesm merged pull request #6: Remove very stale docs/latest URL: https://github.com/apache/arrow-site/pull/6 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

[GitHub] [arrow-site] wesm commented on issue #6: Remove very stale docs/latest

2019-06-11 Thread GitBox
wesm commented on issue #6: Remove very stale docs/latest URL: https://github.com/apache/arrow-site/pull/6#issuecomment-501073903 +1 This is an automated message from the Apache Git Service. To respond to the message, please l

[jira] [Created] (ARROW-5562) pyarrow parquet writer does not handle negative zero correctly

2019-06-11 Thread Bob Briody (JIRA)
Bob Briody created ARROW-5562: - Summary: pyarrow parquet writer does not handle negative zero correctly Key: ARROW-5562 URL: https://issues.apache.org/jira/browse/ARROW-5562 Project: Apache Arrow

Re: Avro to Arrow?

2019-06-11 Thread Wes McKinney
Hi Tim, I'd ideally like to see the work done in the Arrow C++ library so that it can be utilized by all the C++ "binders" (Python, R, C, Ruby, MATLAB). This also means a larger labor pool of individuals to help improve and maintain the software. There was a stalled PR around this a time back (che

[GitHub] [arrow-site] nealrichardson opened a new pull request #6: Remove very stale docs/latest

2019-06-11 Thread GitBox
nealrichardson opened a new pull request #6: Remove very stale docs/latest URL: https://github.com/apache/arrow-site/pull/6 See discussion on https://issues.apache.org/jira/browse/ARROW-5548 This is an automated message from t

[jira] [Created] (ARROW-5561) [Release] Build and publish Rust docs

2019-06-11 Thread Chao Sun (JIRA)
Chao Sun created ARROW-5561: --- Summary: [Release] Build and publish Rust docs Key: ARROW-5561 URL: https://issues.apache.org/jira/browse/ARROW-5561 Project: Apache Arrow Issue Type: Improvement

Avro to Arrow?

2019-06-11 Thread Tim Swast
Hi Arrow and Avro devs, I've been investigating some performance issues with the BigQuery Storage API (https://github.com/googleapis/google-cloud-python/issues/7805), and have identified that the vast majority of time is spent decoding Avro into pandas dataframes.

[jira] [Created] (ARROW-5560) Cannot create Plasma object after OutOfMemory error

2019-06-11 Thread Stephanie Wang (JIRA)
Stephanie Wang created ARROW-5560: - Summary: Cannot create Plasma object after OutOfMemory error Key: ARROW-5560 URL: https://issues.apache.org/jira/browse/ARROW-5560 Project: Apache Arrow Is

[jira] [Created] (ARROW-5559) [C++] Introduce IpcOptions struct object for better API-stability when adding new options

2019-06-11 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-5559: --- Summary: [C++] Introduce IpcOptions struct object for better API-stability when adding new options Key: ARROW-5559 URL: https://issues.apache.org/jira/browse/ARROW-5559

[jira] [Created] (ARROW-5558) [C++] Support Array::View on arrays with non-zero offsets

2019-06-11 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-5558: --- Summary: [C++] Support Array::View on arrays with non-zero offsets Key: ARROW-5558 URL: https://issues.apache.org/jira/browse/ARROW-5558 Project: Apache Arrow

[jira] [Created] (ARROW-5557) [C++] Investigate performance of VisitBitsUnrolled on different platforms

2019-06-11 Thread Rylan Dmello (JIRA)
Rylan Dmello created ARROW-5557: --- Summary: [C++] Investigate performance of VisitBitsUnrolled on different platforms Key: ARROW-5557 URL: https://issues.apache.org/jira/browse/ARROW-5557 Project: Apache

[jira] [Created] (ARROW-5556) [Doc] Document JSON reader

2019-06-11 Thread Antoine Pitrou (JIRA)
Antoine Pitrou created ARROW-5556: - Summary: [Doc] Document JSON reader Key: ARROW-5556 URL: https://issues.apache.org/jira/browse/ARROW-5556 Project: Apache Arrow Issue Type: Improvement

Arrow sync call tomorrow (June 12) at 12:00 US/Eastern, 16:00 UTC

2019-06-11 Thread Neal Richardson
Hi everyone, Reminder that the biweekly Arrow call is tomorrow at https://meet.google.com/vtm-teks-phx. All are welcome to join. Given the current discussions on the list, including around timing and content of upcoming releases, there's lots to chat about. Notes will be sent out to the mailing li

[jira] [Created] (ARROW-5555) [R] install_arrow()

2019-06-11 Thread Neal Richardson (JIRA)
Neal Richardson created ARROW-: -- Summary: [R] install_arrow() Key: ARROW- URL: https://issues.apache.org/jira/browse/ARROW- Project: Apache Arrow Issue Type: Improvement

Re: [VOTE] Formalizing "Extension Type" metadata in Arrow binary protocol

2019-06-11 Thread Micah Kornfield
+1 (non-binding) On Tue, Jun 11, 2019 at 6:08 AM Antoine Pitrou wrote: > > Le 10/06/2019 à 22:28, Wes McKinney a écrit : > > > > Please vote to accept these changes (see [3] for the actual changes). > > The vote will be open for at least 72 hours > > > > [ ] +1: Adopt these changes into the Arro

Re: [DISCUSS] 32- and 64-bit decimal types

2019-06-11 Thread Ravindra Pindikura
On Tue, Jun 11, 2019 at 2:48 AM Wes McKinney wrote: > On the 1.0.0 protocol discussion, one item that we've skirted for some > time is other decimal sizes: > > https://issues.apache.org/jira/browse/ARROW-2009 > > I understand this is a loaded subject since a deliberate decision was > made to remo

[jira] [Created] (ARROW-5554) Add a python wrapper for arrow::Concatenate

2019-06-11 Thread Zhuo Peng (JIRA)
Zhuo Peng created ARROW-5554: Summary: Add a python wrapper for arrow::Concatenate Key: ARROW-5554 URL: https://issues.apache.org/jira/browse/ARROW-5554 Project: Apache Arrow Issue Type: Improvem

Re: [DISCUSS] 32- and 64-bit decimal types

2019-06-11 Thread Antoine Pitrou
Le 10/06/2019 à 23:24, Wes McKinney a écrit : > > BTW, even if we do not allow 32/64 bit decimals in the format, we > should consider adding a bitWidth field with static value 128 as a > matter of future-proofing the metadata. This change would make it so > that old readers are unable to see the

[jira] [Created] (ARROW-5553) red-arrow gem does not compile on ruby:2.5 docker image

2019-06-11 Thread Sean Dilda (JIRA)
Sean Dilda created ARROW-5553: - Summary: red-arrow gem does not compile on ruby:2.5 docker image Key: ARROW-5553 URL: https://issues.apache.org/jira/browse/ARROW-5553 Project: Apache Arrow Issue

Re: Propose custom_metadata for Footer

2019-06-11 Thread John Muehlhausen
Yes On Tue, Jun 11, 2019 at 9:16 AM Wes McKinney wrote: > Sounds reasonable. John, would you like to create a C++ implementation > of this? It would be helpful for a vote to have an initial patch. > > On Mon, Jun 10, 2019 at 12:24 AM Micah Kornfield > wrote: > > > > Sorry for the late reply. I

Re: Propose custom_metadata for Footer

2019-06-11 Thread Wes McKinney
Sounds reasonable. John, would you like to create a C++ implementation of this? It would be helpful for a vote to have an initial patch. On Mon, Jun 10, 2019 at 12:24 AM Micah Kornfield wrote: > > Sorry for the late reply. I think it sounds reasonable to have custom > metadata in the footer as w

Re: [VOTE] Formalizing "Extension Type" metadata in Arrow binary protocol

2019-06-11 Thread Antoine Pitrou
Le 10/06/2019 à 22:28, Wes McKinney a écrit : > > Please vote to accept these changes (see [3] for the actual changes). > The vote will be open for at least 72 hours > > [ ] +1: Adopt these changes into the Arrow columnar format specification > [ ] +0: . . . > [ ] -1: I disagree because . . . >

[jira] [Created] (ARROW-5552) Go: make Schema and Field implement Stringer

2019-06-11 Thread Sebastien Binet (JIRA)
Sebastien Binet created ARROW-5552: -- Summary: Go: make Schema and Field implement Stringer Key: ARROW-5552 URL: https://issues.apache.org/jira/browse/ARROW-5552 Project: Apache Arrow Issue T

[jira] [Created] (ARROW-5551) [Go] invalid FixedSizeArray representation

2019-06-11 Thread Sebastien Binet (JIRA)
Sebastien Binet created ARROW-5551: -- Summary: [Go] invalid FixedSizeArray representation Key: ARROW-5551 URL: https://issues.apache.org/jira/browse/ARROW-5551 Project: Apache Arrow Issue Typ