Re: [DISCUSS][Format] Time Interval Changes

2019-04-01 Thread Micah Kornfield
On Mon, Apr 1, 2019 at 4:17 PM Jacques Nadeau wrote: > >> >> I don't think we should include byte-width unless we have a concrete >> use-case (it can be added later, using 8 Bytes as the default if not set). >> > I'm okay with only allowing one today. I wonder whether we should declare > it now a

[jira] [Created] (ARROW-5088) [C++] Disallow using BUILD_WARNING_LEVEL=CHECKIN in release mode

2019-04-01 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-5088: --- Summary: [C++] Disallow using BUILD_WARNING_LEVEL=CHECKIN in release mode Key: ARROW-5088 URL: https://issues.apache.org/jira/browse/ARROW-5088 Project: Apache Arrow

Re: [Discuss][Format] Arrow Flight URI scheme proposal

2019-04-01 Thread Jacques Nadeau
My thinking is ideally the protocol would be more opaque than engineer-y in that an upgrade would happen as part of the negotiation process. For example, when a connection is made, client says "hey, I also support these things" and then server responds and says "hey, let's send data on this channel

Re: [DISCUSS][Format] Time Interval Changes

2019-04-01 Thread Jacques Nadeau
> > > > I don't think we should include byte-width unless we have a concrete > use-case (it can be added later, using 8 Bytes as the default if not set). > I'm okay with only allowing one today. I wonder whether we should declare it now and only allow 8? > > Comment below on equivalences, is that

[jira] [Created] (ARROW-5087) [Debian] APT repository no longer contains libarrow-dev

2019-04-01 Thread Steven Fackler (JIRA)
Steven Fackler created ARROW-5087: - Summary: [Debian] APT repository no longer contains libarrow-dev Key: ARROW-5087 URL: https://issues.apache.org/jira/browse/ARROW-5087 Project: Apache Arrow

Re: Support for numpy matrix

2019-04-01 Thread Mitar
Hi! I agree. This is in fact all information which is already there. :-) Mitar On Sat, Mar 30, 2019 at 8:40 PM Wes McKinney wrote: > > hi Mitar, > > Let's discuss further on JIRA? It's best to keep all the information > about the issue in one place. > > Thanks > > On Sat, Mar 30, 2019 at 7:42

Re: [DISCUSS][Format] Time Interval Changes

2019-04-01 Thread Micah Kornfield
Sorry sent this too early. TL;DR; I'm in favor of moving forward with this declaration: table Interval { unit: IntervalUnit; timeUnit: TimeUnit; // defined when using duration } I don't think we should include byte-width unless we have a concrete use-case (it can be added later, using 8 Bytes

Re: [DISCUSS][Format] Time Interval Changes

2019-04-01 Thread Micah Kornfield
TL;DR; I'm in favor of moving forward with this declaration: On Mon, Apr 1, 2019 at 11:38 AM Jacques Nadeau wrote: > I'm sorry, I've been busy with several other things. > > A question, what about this alternative? > enum IntervalUnit: short { YEAR_MONTH, DAY_TIME, DURATION } > table Interva

Re: [R] Improving documentation and transparency for Arrow build and packaging work for R

2019-04-01 Thread Javier Luraschi
Added entry for "Updating CRAN packages" here: https://cwiki.apache.org/confluence/display/ARROW/Release+Management+Guide#ReleaseManagementGuide-UpdatingCRANpackages I'm sure we will have to update with more details and

Re: [DISCUSS][Format] Time Interval Changes

2019-04-01 Thread Jacques Nadeau
I'm sorry, I've been busy with several other things. A question, what about this alternative? enum IntervalUnit: short { YEAR_MONTH, DAY_TIME, DURATION } table Interval { unit: IntervalUnit; timeUnit: TimeUnit; // defined when using duration byteWidth: short; // defined when using duration

Re: [RESULT][VOTE] Release Apache Arrow 0.13.0 - RC4

2019-04-01 Thread Wes McKinney
I have written a blog post, additions from other maintainers would be welcome https://github.com/apache/arrow/pull/4091 I'll plan to publish tomorrow morning. Would someone like to help with updating the documentation sites? I believe that generating the documentation site currently requires a C

Re: [DISCUSS][Format] Time Interval Changes

2019-04-01 Thread Wes McKinney
I would like to propose a vote on this feature this week. Could someone from the Java side weigh in since there is some existing code relating to intervals there already? On Wed, Mar 27, 2019 at 10:49 PM Micah Kornfield wrote: > > Hi Wes, > Thanks for the feedback. I'm happy to update the PR to

Re: [Discuss][Format] Arrow Flight URI scheme proposal

2019-04-01 Thread Wes McKinney
I would like to propose a vote on this feature this week. I'll wait another day or so before starting the vote On Fri, Mar 29, 2019 at 4:14 PM David Li wrote: > > Hey Wes, > > Thanks for the comments. I've updated the doc a bit to reflect some > options for supporting "hybrid" transports. > > Bes

[jira] [Created] (ARROW-5086) Space leak in ParquetFile.read_row_group()

2019-04-01 Thread JIRA
Jakub Okoński created ARROW-5086: Summary: Space leak in ParquetFile.read_row_group() Key: ARROW-5086 URL: https://issues.apache.org/jira/browse/ARROW-5086 Project: Apache Arrow Issue Type:

Re: [Discuss] Benchmarking infrastructure

2019-04-01 Thread Wes McKinney
hi David -- yes, we definitely should set up cross-host and cross-implementation performance testing (that we can measure and record in the benchmark database) for Flight. As one starting point https://issues.apache.org/jira/browse/ARROW-4566 - Wes On Mon, Apr 1, 2019 at 10:30 AM David Li wrote

[jira] [Created] (ARROW-5085) [Python/C++] Conversion of dict encoded null column fails in parquet writing when using RowGroups

2019-04-01 Thread Florian Jetter (JIRA)
Florian Jetter created ARROW-5085: - Summary: [Python/C++] Conversion of dict encoded null column fails in parquet writing when using RowGroups Key: ARROW-5085 URL: https://issues.apache.org/jira/browse/ARROW-5085

[jira] [Created] (ARROW-5084) [Website] Blog post / release announcement for 0.13.0

2019-04-01 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-5084: --- Summary: [Website] Blog post / release announcement for 0.13.0 Key: ARROW-5084 URL: https://issues.apache.org/jira/browse/ARROW-5084 Project: Apache Arrow Issu

Re: [Discuss] Benchmarking infrastructure

2019-04-01 Thread David Li
One more thought, is there interest in running cross-host Flight benchmarks, and perhaps validating them against iperf or a similar tool? It would be great to get latency/throughput numbers and make sure upgrades to gRPC don't tank performance on accident, and it would help argue for why people sho

[jira] [Created] (ARROW-5083) [Developer] In merge_arrow_pr.py script, allow user to set a released Fix Version

2019-04-01 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-5083: --- Summary: [Developer] In merge_arrow_pr.py script, allow user to set a released Fix Version Key: ARROW-5083 URL: https://issues.apache.org/jira/browse/ARROW-5083 Project

[jira] [Created] (ARROW-5082) [Python][Packaging] Reduce size of macOS and manylinux1 wheels

2019-04-01 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-5082: --- Summary: [Python][Packaging] Reduce size of macOS and manylinux1 wheels Key: ARROW-5082 URL: https://issues.apache.org/jira/browse/ARROW-5082 Project: Apache Arrow

Re: Unable to merge PRs on new computer

2019-04-01 Thread Andy Grove
Ohh... thanks, Antoine! I'm pretty sure that's the issue. On Mon, Apr 1, 2019 at 7:18 AM Antoine Pitrou wrote: > > Le 01/04/2019 à 15:12, Andy Grove a écrit : > > Hi Krisztian, > > > > It's basically a github permissions issue when it comes to pushing the > > merged PR branch to apache master. >

Re: Unable to merge PRs on new computer

2019-04-01 Thread Antoine Pitrou
Le 01/04/2019 à 15:12, Andy Grove a écrit : > Hi Krisztian, > > It's basically a github permissions issue when it comes to pushing the > merged PR branch to apache master. > > I get prompted for git username on the command line, which doesn't happen > on my old desktop. Ensure you're pushing t

Re: Unable to merge PRs on new computer

2019-04-01 Thread Andy Grove
Hi Krisztian, It's basically a github permissions issue when it comes to pushing the merged PR branch to apache master. I get prompted for git username on the command line, which doesn't happen on my old desktop. I did create a github API token and set ARROW_GITHUB_API_TOKEN but it doesn't help.

[jira] [Created] (ARROW-5081) [C++] Consistently use PATH_SUFFIXES in CMake config

2019-04-01 Thread David Li (JIRA)
David Li created ARROW-5081: --- Summary: [C++] Consistently use PATH_SUFFIXES in CMake config Key: ARROW-5081 URL: https://issues.apache.org/jira/browse/ARROW-5081 Project: Apache Arrow Issue Type: I

Re: FPGA support for Apache Arrow

2019-04-01 Thread Dimitris Lekkas
Alright, I think I understand your suggestion. It's like defining a SharedMemoryPool for example where we implement our desired way of memory allocation (with page-alignment + mem mapping). Thanks! On Mon, Apr 1, 2019 at 1:34 PM Antoine Pitrou wrote: > On Mon, 1 Apr 2019 10:25:26 +0300 > Dimitr

Re: FPGA support for Apache Arrow

2019-04-01 Thread Antoine Pitrou
On Mon, 1 Apr 2019 10:25:26 +0300 Dimitris Lekkas wrote: > Hey Antoine, > > Regarding the C++ side, do you consider variable alignment for buffers > a valuable addition? Would you be interested for such a PR? I think that it you have specific alignment requirements (and perhaps other requirement

Re: Unable to merge PRs on new computer

2019-04-01 Thread Krisztián Szűcs
Hey Andy, What's the error message? Ensure, that You've correctly set up the environment variables described in the docstring: # This utility assumes you already have a local Arrow git clone and that you # have added remotes corresponding to both (i) the Github Apache Arrow mirror # and (ii) the

Re: CI failure by Sphinx

2019-04-01 Thread Krisztián Szűcs
Sphinx has a new release which is not followed by breathe yet. I'm pinning sphinx. On Mon, Apr 1, 2019 at 4:55 AM Kouhei Sutou wrote: > Hi, > > Our CI is failed since 2 days ago: > https://travis-ci.org/apache/arrow/jobs/513191828#L5131-L5141 > > > /home/travis/build/apache/arrow/pyarrow-

Re: FPGA support for Apache Arrow

2019-04-01 Thread Dimitris Lekkas
Hey Antoine, Regarding the C++ side, do you consider variable alignment for buffers a valuable addition? Would you be interested for such a PR? Finally, I want to ask if I should be discussing such issues on Jira instead of dev mailing list. Thanks for the replies, despite you being busy with the

Re: [RESULT][VOTE] Release Apache Arrow 0.13.0 - RC4

2019-04-01 Thread Kouhei Sutou
Thanks Krisztián! I've updated MSYS2 package: https://github.com/msys2/MINGW-packages/pull/5129 Remaining task: * Release announce (Wes will do tomorrow) Thanks, -- kou In "Re: [RESULT][VOTE] Release Apache Arrow 0.13.0 - RC4" on Mon, 1 Apr 2019 08:50:29 +0200, Krisztián Szűcs wrot