[jira] [Created] (ARROW-7812) [CI] Upgrade LLVM in manylinux1 docker image

2020-02-09 Thread Prudhvi Porandla (Jira)
Prudhvi Porandla created ARROW-7812: --- Summary: [CI] Upgrade LLVM in manylinux1 docker image Key: ARROW-7812 URL: https://issues.apache.org/jira/browse/ARROW-7812 Project: Apache Arrow Issue

Arrow Datasets Functionality for Python

2020-02-09 Thread Matthew Turner
Hi Wes / Arrow Dev Team, Following up on our brief twitter convo on the Datasets functionality in R / Python. To provide context to others, you had mentioned that the API in python / pyarrow was more developer centric and intended for u

[jira] [Created] (ARROW-7811) pyarrow 0.15.1 wheels on PyPI no longer supports pyarrow.orc?

2020-02-09 Thread Zhenyi Zhou (Jira)
Zhenyi Zhou created ARROW-7811: -- Summary: pyarrow 0.15.1 wheels on PyPI no longer supports pyarrow.orc? Key: ARROW-7811 URL: https://issues.apache.org/jira/browse/ARROW-7811 Project: Apache Arrow

[jira] [Created] (ARROW-7810) Fixed typo and made code running in vignette

2020-02-09 Thread Zhuo Jia Dai (Jira)
Zhuo Jia Dai created ARROW-7810: --- Summary: Fixed typo and made code running in vignette Key: ARROW-7810 URL: https://issues.apache.org/jira/browse/ARROW-7810 Project: Apache Arrow Issue Type: I

[jira] [Created] (ARROW-7809) R vignette does not run on Win 10 nor ubuntu

2020-02-09 Thread Zhuo Jia Dai (Jira)
Zhuo Jia Dai created ARROW-7809: --- Summary: R vignette does not run on Win 10 nor ubuntu Key: ARROW-7809 URL: https://issues.apache.org/jira/browse/ARROW-7809 Project: Apache Arrow Issue Type: B

[jira] [Created] (ARROW-7808) [Java][Dataset] Implement Datasets Java API

2020-02-09 Thread Hongze Zhang (Jira)
Hongze Zhang created ARROW-7808: --- Summary: [Java][Dataset] Implement Datasets Java API Key: ARROW-7808 URL: https://issues.apache.org/jira/browse/ARROW-7808 Project: Apache Arrow Issue Type: I

Re: [VOTE] Release Apache Arrow 0.16.0 - RC2

2020-02-09 Thread Sutou Kouhei
Hi, MSYS2 package is updated: https://github.com/msys2/MINGW-packages/pull/6175 Thanks, -- kou In "Re: [VOTE] Release Apache Arrow 0.16.0 - RC2" on Sun, 9 Feb 2020 09:06:39 -0800, Neal Richardson wrote: > R package 0.16.0 has been accepted by CRAN; may take a few more days for > CRAN to

[jira] [Created] (ARROW-7807) [R] Installation on RHEL 7 Cannot call io___MemoryMappedFile__Open()

2020-02-09 Thread Omar Yassin (Jira)
Omar Yassin created ARROW-7807: -- Summary: [R] Installation on RHEL 7 Cannot call io___MemoryMappedFile__Open() Key: ARROW-7807 URL: https://issues.apache.org/jira/browse/ARROW-7807 Project: Apache Arrow

Re: [Format] Dictionary edge cases (encoding nulls and nested dictionaries)

2020-02-09 Thread Brian Hulette
> It seems we should potentially disallow dictionaries to contain null values? +1 - I've always thought it was odd you could encode null values in two different places for dictionary encoded columns. You could argue it's more efficient to encode the nulls in the dictionary, but I think if we're goi

Re: [pyarrow] How can one handle parquet encoding memory bombs

2020-02-09 Thread Rollo Konig-Brock
Hey Wes, I've opened up a MR for this ARROW-7800. Tests aren't really done as it was committed from a WIP to demonstrate what was happening with categorical types and what was needed in order to sidestep this. I might need a bit of advice what else to test here. Rollo On Fri, Feb 7, 2020 at 7:

Re: [VOTE] Release Apache Arrow 0.16.0 - RC2

2020-02-09 Thread Neal Richardson
R package 0.16.0 has been accepted by CRAN; may take a few more days for CRAN to build Windows and macOS binaries. Neal On Fri, Feb 7, 2020 at 4:50 PM Neal Richardson wrote: > Homebrew PR is up: https://github.com/Homebrew/homebrew-core/pull/49908 > > > > On Fri, Feb 7, 2020 at 3:44 PM Neal Ric

[jira] [Created] (ARROW-7806) [Python] {Array,Table,RecordBatch}.to_pandas() do not support Large variants of ListArray, BinaryArray and StringArray

2020-02-09 Thread Zhuo Peng (Jira)
Zhuo Peng created ARROW-7806: Summary: [Python] {Array,Table,RecordBatch}.to_pandas() do not support Large variants of ListArray, BinaryArray and StringArray Key: ARROW-7806 URL: https://issues.apache.org/jira/browse/

[NIGHTLY] Arrow Build Report for Job nightly-2020-02-09-0

2020-02-09 Thread Crossbow
Arrow Build Report for Job nightly-2020-02-09-0 All tasks: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-02-09-0 Failed Tasks: - test-conda-python-3.7-turbodbc-latest: URL: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-02-09-0-circle-test-cond