[jira] [Created] (ARROW-8193) [C++] arrow-future-test fails to compile on gcc 4.8

2020-03-23 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8193: --- Summary: [C++] arrow-future-test fails to compile on gcc 4.8 Key: ARROW-8193 URL: https://issues.apache.org/jira/browse/ARROW-8193 Project: Apache Arrow Issue

[jira] [Created] (ARROW-8192) [C++] script for unpack avx512 intrinsics code

2020-03-23 Thread Frank Du (Jira)
Frank Du created ARROW-8192: --- Summary: [C++] script for unpack avx512 intrinsics code Key: ARROW-8192 URL: https://issues.apache.org/jira/browse/ARROW-8192 Project: Apache Arrow Issue Type: Improve

[NIGHTLY] Arrow Build Report for Job nightly-2020-03-23-1

2020-03-23 Thread Crossbow
Arrow Build Report for Job nightly-2020-03-23-1 All tasks: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-03-23-1 Failed Tasks: - debian-stretch: URL: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-03-23-1-github-debian-stretch - gandiva-jar-tru

[jira] [Created] (ARROW-8191) [Packaging][APT] Fix cmake removal in Debian GNU/Linux Stretch

2020-03-23 Thread Kouhei Sutou (Jira)
Kouhei Sutou created ARROW-8191: --- Summary: [Packaging][APT] Fix cmake removal in Debian GNU/Linux Stretch Key: ARROW-8191 URL: https://issues.apache.org/jira/browse/ARROW-8191 Project: Apache Arrow

[jira] [Created] (ARROW-8190) [C++][Flight] Allow setting IpcWriteOptions and IpcReadOptions in Flight IPC message reader and writer classes

2020-03-23 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8190: --- Summary: [C++][Flight] Allow setting IpcWriteOptions and IpcReadOptions in Flight IPC message reader and writer classes Key: ARROW-8190 URL: https://issues.apache.org/jira/browse/AR

Re: Write a parquet file with delta encoding enable

2020-03-23 Thread Wes McKinney
These encodings are not available for use in the Parquet C++ library yet -- partially implemented but not thoroughly tested or exposed in the public API -- so it's not possible to generate them from Python. I don't know about Java, you may want to ask on the Parquet mailing list On Mon, Mar 23, 20

Re: Java API for parsing parque

2020-03-23 Thread Wes McKinney
There's an effort to expose the C++ Parquet library to Java via JNI that seems promising https://issues.apache.org/jira/browse/ARROW-6720 On Mon, Mar 23, 2020 at 11:15 AM Hasara Maithree wrote: > > Hi all, > > Is there a Java API for parsing Parque format to Arrow format? > > Thank You

Re: [DISCUSS] Adding "trivial" buffer compression option to IPC protocol (ARROW-300)

2020-03-23 Thread Antoine Pitrou
Le 24/03/2020 à 00:39, Wes McKinney a écrit : > > As far as what Micah said about having a limited number of > compressors: I would be in favor of having just LZ4 and ZSTD. +1, exactly my thought as well. Regards Antoine.

Re: [DISCUSS] Adding "trivial" buffer compression option to IPC protocol (ARROW-300)

2020-03-23 Thread Wes McKinney
hi folks, Sorry it's taken me a little while to produce supporting benchmarks. * I implemented experimental trivial body buffer compression in https://github.com/apache/arrow/pull/6638 * I hooked up the Arrow IPC file format with compression as the new Feather V2 format in https://github.com/apac

[jira] [Created] (ARROW-8189) [Python] Python bindings for C++ Builder classes

2020-03-23 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8189: --- Summary: [Python] Python bindings for C++ Builder classes Key: ARROW-8189 URL: https://issues.apache.org/jira/browse/ARROW-8189 Project: Apache Arrow Issue Typ

Re: [Discuss] Proposal for optimizing Datasets over S3/object storage

2020-03-23 Thread David Li
Thanks. I've set up an AWS account for my own testing for now. I've also submitted a PR to add a basic benchmark which can be run self-contained, against a local Minio instance, or against S3: https://github.com/apache/arrow/pull/6675 I ran the benchmark from my local machine, and I can test from

Java API for parsing parque

2020-03-23 Thread Hasara Maithree
Hi all, Is there a Java API for parsing Parque format to Arrow format? Thank You

[jira] [Created] (ARROW-8188) [R] Adapt to latest checks in R-devel

2020-03-23 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-8188: -- Summary: [R] Adapt to latest checks in R-devel Key: ARROW-8188 URL: https://issues.apache.org/jira/browse/ARROW-8188 Project: Apache Arrow Issue Type: Bu

[jira] [Created] (ARROW-8187) [R] Make test assertions robust to i18n

2020-03-23 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-8187: -- Summary: [R] Make test assertions robust to i18n Key: ARROW-8187 URL: https://issues.apache.org/jira/browse/ARROW-8187 Project: Apache Arrow Issue Type:

[NIGHTLY] Arrow Build Report for Job nightly-2020-03-23-0

2020-03-23 Thread Crossbow
Arrow Build Report for Job nightly-2020-03-23-0 All tasks: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-03-23-0 Failed Tasks: - debian-stretch: URL: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-03-23-0-github-debian-stretch - gandiva-jar-tru

[jira] [Created] (ARROW-8186) [Python] Dataset expression != returns bool instead of expression for invalid value

2020-03-23 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-8186: Summary: [Python] Dataset expression != returns bool instead of expression for invalid value Key: ARROW-8186 URL: https://issues.apache.org/jira/browse/ARROW-8186

Re: [Discuss][FlightRPC] Extensions to Flight: "DoBidirectional"

2020-03-23 Thread David Li
Hey Wes, Thanks for the review. I've broken out the format change into this PR: https://github.com/apache/arrow/pull/6686 Best, David On 3/22/20, Wes McKinney wrote: > hi David, > > I did a preliminary view and things look to be on the right track > there. What do you think about breaking out t

[jira] [Created] (ARROW-8185) [Packaging] Document the available nightly wheels, conda and R packages under the development section

2020-03-23 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8185: -- Summary: [Packaging] Document the available nightly wheels, conda and R packages under the development section Key: ARROW-8185 URL: https://issues.apache.org/jira/browse/ARROW

[jira] [Created] (ARROW-8184) [Packaging] Use arrow-nightlies (or similar) organization name on Anaconda and Gemfury to host the nightlies

2020-03-23 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8184: -- Summary: [Packaging] Use arrow-nightlies (or similar) organization name on Anaconda and Gemfury to host the nightlies Key: ARROW-8184 URL: https://issues.apache.org/jira/brows

Write a parquet file with delta encoding enable

2020-03-23 Thread Omega Gamage
I was trying to write a parquet file with delta encoding. This page , states that parquet supports three types of delta encodings: (DELTA_BINARY_PACKED, DELTA_LENGTH_BYTE_ARRAY, DELTA_BYTE_ARRAY). Since spark, pyspark or pyarr