Re: Problems running merge_arrow_pr

2018-08-19 Thread Wes McKinney
hi Andy, I think the system Python on Ubuntu 16.04 should work fine (we support both 2.7 and 3.5/6). What do the following commands print for you? $ python Python 2.7.15 | packaged by conda-forge | (default, May 8 2018, 14:46:53) [GCC 4.8.2 20140120 (Red Hat 4.8.2-15)] on linux2 Type "help", "co

[jira] [Created] (ARROW-3090) Accompany error messages with assertions

2018-08-19 Thread Chao Sun (JIRA)
Chao Sun created ARROW-3090: --- Summary: Accompany error messages with assertions Key: ARROW-3090 URL: https://issues.apache.org/jira/browse/ARROW-3090 Project: Apache Arrow Issue Type: New Feature

Problems running merge_arrow_pr

2018-08-19 Thread Andy Grove
I'm trying to merge my first PR and the script is failing with the following output. What version of Python should I be running? Is there some prerequisite that I'm missing? I did install jira-python. I'm running on Ubuntu 16.04 in case that is relevant. andy@freedom:~/git/arrow$ dev/merge_arrow_

Re: [DISCUSS] Rust add adapter for parquet

2018-08-19 Thread Renjie Liu
Hi, Chao: I've opened an jira issue for that and planning to work on that. On Mon, Aug 20, 2018 at 11:03 AM Renjie Liu wrote: > Yes, it's a mistake, sorry for that > > > On Mon, Aug 20, 2018 at 10:57 AM Chao Sun wrote: > >> (s/flink/arrow - it is a mistake?) >> >> Thanks Renjie for your interes

[jira] [Created] (ARROW-3089) Add ArrayBuilder for different Arrow arrays

2018-08-19 Thread Chao Sun (JIRA)
Chao Sun created ARROW-3089: --- Summary: Add ArrayBuilder for different Arrow arrays Key: ARROW-3089 URL: https://issues.apache.org/jira/browse/ARROW-3089 Project: Apache Arrow Issue Type: New Featur

Re: [DISCUSS] Rust add adapter for parquet

2018-08-19 Thread Renjie Liu
Yes, it's a mistake, sorry for that On Mon, Aug 20, 2018 at 10:57 AM Chao Sun wrote: > (s/flink/arrow - it is a mistake?) > > Thanks Renjie for your interest. Yes, one of the next step in parquet-rs > is to integrate with Apache Arrow. Actually we just had a discussion >

Re: [DISCUSS] Rust add adapter for parquet

2018-08-19 Thread Chao Sun
(s/flink/arrow - it is a mistake?) Thanks Renjie for your interest. Yes, one of the next step in parquet-rs is to integrate with Apache Arrow. Actually we just had a discussion about this recently. Feel free to share your comments on the github.

[jira] [Created] (ARROW-3088) [Rust] Use internal `Result` type instead of `Result

2018-08-19 Thread Paddy Horan (JIRA)
Paddy Horan created ARROW-3088: -- Summary: [Rust] Use internal `Result` type instead of `Result Key: ARROW-3088 URL: https://issues.apache.org/jira/browse/ARROW-3088 Project: Apache Arrow Issue

Re: [DISCUSS] Solutions for improving the Arrow-Parquet C++ development morass

2018-08-19 Thread Wes McKinney
hi Uwe, I agree with your points. Currently we have 3 software artifacts: 1. Arrow C++ libraries 2. Parquet C++ libraries with Arrow columnar integration 3. C++ interop layer for Python + Cython bindings Changes in #1 prompt an awkward workflow involving multiple PRs; as a result of this we just

Re: [DISCUSS] Moving forward on the Arrow-Parquet C++ monorepo project

2018-08-19 Thread Julian Hyde
The votes to grant commit access that you refer to are votes to appoint committers or PMC members. Those votes are conducted in private to prevent embarrassment in case the vote fails, or if the vote passes and the individual declines the offer. I don’t see any such potential embarrassment here

Re: Error building parquet-cpp

2018-08-19 Thread Renato Marroquín Mogrovejo
I tried again a clean installation, using master in both projects and that went well. Thanks again! 2018-08-19 16:56 GMT+02:00 Renato Marroquín Mogrovejo < renatoj.marroq...@gmail.com>: > Thanks Uwe! > I just tried, using master on both (arrow and parquet-cpp) and now I got > this: > > /mnt/SG/ma

Re: Error building parquet-cpp

2018-08-19 Thread Renato Marroquín Mogrovejo
Thanks Uwe! I just tried, using master on both (arrow and parquet-cpp) and now I got this: /mnt/SG/marenato/project_jitq/parquet-cpp/src/parquet/statistics.cc:136:12: error: ‘__builtin_isnan’ is not a member of ‘std’ if (!std::isnan(values[i])) { ^ I am using Debian btw. 2018-

[jira] [Created] (ARROW-3087) [C++] Add kernels for comparison operations to scalars

2018-08-19 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-3087: -- Summary: [C++] Add kernels for comparison operations to scalars Key: ARROW-3087 URL: https://issues.apache.org/jira/browse/ARROW-3087 Project: Apache Arrow Issue

Re: Error building parquet-cpp

2018-08-19 Thread Uwe L. Korn
Hello Renato, you will need Arrow 0.9 for the latest parquet release. If you want to Arrow 0.10, you need to use the parquet-cpp master. We plan to make a parquet-cpp 1.5.0 release soon that will support Arrow 0.10, then you can combine the latest release versions of both again. Uwe On Sun, A

Error building parquet-cpp

2018-08-19 Thread Renato Marroquín Mogrovejo
Hi all, I am trying to build parquet-cpp (tag/apache-parquet-cpp-1.4.0-rc1) with arrow/cpp (tag/apache-arrow-0.10.0) I am using: g++-5 (Debian 5.5.0-12) 5.5.0 20171010 Arrow/cpp builds and compiles successfully. But when building parquet-cpp, I am getting many errors as the following. /mnt/SG/m

Re: [DISCUSS] Moving forward on the Arrow-Parquet C++ monorepo project

2018-08-19 Thread Uwe L. Korn
Hello Wes, I think we should have a vote on this. I expect that, as previously mentioned in this debate, there is a large overlap between the two projects as it's on the C++ side basically the same community but as it is an issue concerning both projects, the vote shall also be done in the two

Re: [DISCUSS] Solutions for improving the Arrow-Parquet C++ development morass

2018-08-19 Thread Uwe L. Korn
Back from vacation, I also want to finally raise my voice. With the current state of the Parquet<->Arrow development, I see a benefit in merging the code base for now, but not necessarily forever. Parquet C++ is the main code base of an artefact for which an Arrow C++ adapter is built and that

[jira] [Created] (ARROW-3086) [Glib] GISCAN fails due to conda-shipped openblas

2018-08-19 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-3086: -- Summary: [Glib] GISCAN fails due to conda-shipped openblas Key: ARROW-3086 URL: https://issues.apache.org/jira/browse/ARROW-3086 Project: Apache Arrow Issue Type

Re: Java compatibility issues with parquet-mr project

2018-08-19 Thread Uwe L. Korn
Hello, I've just merged https://github.com/apache/parquet-mr/pull/516. This updates the arrow dependency to 0.10.0 in parquet-arrow and should fix the build errors you were seeing with FixedSizeBinary Uwe On Wed, Aug 8, 2018, at 12:39 PM, Tal Yaniv wrote: > Hi Masayuki, > > I changed the arro

Re: [DISCUSS] Rust add adapter for parquet

2018-08-19 Thread Uwe L. Korn
Hello, you might also want to raise this with the https://github.com/sunchao/parquet-rs project. The overlap between the developers of this project and the Arrow Rust implementation is quite large but still it may make sense to also start a discussion there. Uwe On Thu, Aug 16, 2018, at 9:14

Re: Building a fast Arrow-native delimited file reader (e.g. for CSVs)

2018-08-19 Thread Uwe L. Korn
Hello Antoine and Wes, really excited to see this happen. CSVs and co are the file formats you never get rid of, so it is really important to have an Arrow reader. Concerning the custom implementation I can further back this as during the parquet_arrow reader, I have spent quite some amount of

[jira] [Created] (ARROW-3085) [Rust] Add an adapter for parquet.

2018-08-19 Thread Renjie Liu (JIRA)
Renjie Liu created ARROW-3085: - Summary: [Rust] Add an adapter for parquet. Key: ARROW-3085 URL: https://issues.apache.org/jira/browse/ARROW-3085 Project: Apache Arrow Issue Type: New Feature