[jira] [Created] (ARROW-9098) RecordBatch::ToStructArray cannot handle record batches with 0 column

2020-06-10 Thread Zhuo Peng (Jira)
Zhuo Peng created ARROW-9098: Summary: RecordBatch::ToStructArray cannot handle record batches with 0 column Key: ARROW-9098 URL: https://issues.apache.org/jira/browse/ARROW-9098 Project: Apache Arrow

[jira] [Created] (ARROW-9071) [C++] MakeArrayOfNull makes invalid ListArray

2020-06-08 Thread Zhuo Peng (Jira)
Zhuo Peng created ARROW-9071: Summary: [C++] MakeArrayOfNull makes invalid ListArray Key: ARROW-9071 URL: https://issues.apache.org/jira/browse/ARROW-9071 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-9037) [C++/C-ABI] unable to import array with null count == -1 (which could be exported)

2020-06-04 Thread Zhuo Peng (Jira)
Zhuo Peng created ARROW-9037: Summary: [C++/C-ABI] unable to import array with null count == -1 (which could be exported) Key: ARROW-9037 URL: https://issues.apache.org/jira/browse/ARROW-9037 Project

[jira] [Created] (ARROW-8277) [Python] RecordBatch interface improvements

2020-03-30 Thread Zhuo Peng (Jira)
Zhuo Peng created ARROW-8277: Summary: [Python] RecordBatch interface improvements Key: ARROW-8277 URL: https://issues.apache.org/jira/browse/ARROW-8277 Project: Apache Arrow Issue Type

[jira] [Created] (ARROW-7806) [Python] {Array,Table,RecordBatch}.to_pandas() do not support Large variants of ListArray, BinaryArray and StringArray

2020-02-09 Thread Zhuo Peng (Jira)
Zhuo Peng created ARROW-7806: Summary: [Python] {Array,Table,RecordBatch}.to_pandas() do not support Large variants of ListArray, BinaryArray and StringArray Key: ARROW-7806 URL: https://issues.apache.org/jira/browse

[jira] [Created] (ARROW-7802) [C++] Support for LargeBinary and LargeString in the hash kernel

2020-02-07 Thread Zhuo Peng (Jira)
Zhuo Peng created ARROW-7802: Summary: [C++] Support for LargeBinary and LargeString in the hash kernel Key: ARROW-7802 URL: https://issues.apache.org/jira/browse/ARROW-7802 Project: Apache Arrow

[jira] [Created] (ARROW-7510) [C++] Array::null_count() is not thread-compatible

2020-01-07 Thread Zhuo Peng (Jira)
Zhuo Peng created ARROW-7510: Summary: [C++] Array::null_count() is not thread-compatible Key: ARROW-7510 URL: https://issues.apache.org/jira/browse/ARROW-7510 Project: Apache Arrow Issue Type

[jira] [Created] (ARROW-7362) [Python] ListArray.flatten() should take care of slicing offsets

2019-12-09 Thread Zhuo Peng (Jira)
Zhuo Peng created ARROW-7362: Summary: [Python] ListArray.flatten() should take care of slicing offsets Key: ARROW-7362 URL: https://issues.apache.org/jira/browse/ARROW-7362 Project: Apache Arrow

[jira] [Created] (ARROW-7229) [C++] Unify ConcatenateTables APIs

2019-11-21 Thread Zhuo Peng (Jira)
Zhuo Peng created ARROW-7229: Summary: [C++] Unify ConcatenateTables APIs Key: ARROW-7229 URL: https://issues.apache.org/jira/browse/ARROW-7229 Project: Apache Arrow Issue Type: Improvement

[jira] [Created] (ARROW-7228) [Python] Expose RecordBatch.FromStructArray in Python.

2019-11-21 Thread Zhuo Peng (Jira)
Zhuo Peng created ARROW-7228: Summary: [Python] Expose RecordBatch.FromStructArray in Python. Key: ARROW-7228 URL: https://issues.apache.org/jira/browse/ARROW-7228 Project: Apache Arrow Issue

[jira] [Created] (ARROW-7227) [Python] Provide wrappers for ConcatenateWithPromotion()

2019-11-21 Thread Zhuo Peng (Jira)
Zhuo Peng created ARROW-7227: Summary: [Python] Provide wrappers for ConcatenateWithPromotion() Key: ARROW-7227 URL: https://issues.apache.org/jira/browse/ARROW-7227 Project: Apache Arrow Issue

ConcatenateTables APIs

2019-11-08 Thread Zhuo Peng
Hi, https://github.com/apache/arrow/pull/5534 introduced ConcatenateTablesWithPromotion(). And there is already a ConcatenateTables() function which behaves differently (it requires the tables to have the schema). Wes raised a concern in that PR [1] that we might end up having many Concatenate

[jira] [Created] (ARROW-6878) [Python] pa.array() does not handle list of dicts with bytes keys correctly under python3

2019-10-14 Thread Zhuo Peng (Jira)
Zhuo Peng created ARROW-6878: Summary: [Python] pa.array() does not handle list of dicts with bytes keys correctly under python3 Key: ARROW-6878 URL: https://issues.apache.org/jira/browse/ARROW-6878

[jira] [Created] (ARROW-6848) [C++] Specify -std=c++11 instead of -std=gnu++11 when building

2019-10-10 Thread Zhuo Peng (Jira)
Zhuo Peng created ARROW-6848: Summary: [C++] Specify -std=c++11 instead of -std=gnu++11 when building Key: ARROW-6848 URL: https://issues.apache.org/jira/browse/ARROW-6848 Project: Apache Arrow

Re: Should Arrow adopt C++14 / 17?

2019-10-04 Thread Zhuo Peng
On 2019/10/04 19:43:04, Wes McKinney wrote: > On Fri, Oct 4, 2019 at 12:45 PM Zhuo Peng wrote: > > > > > > > > On 2019/10/04 17:05:00, Antoine Pitrou wrote: > > > > > > Le 04/10/2019 à 19:01, Zhuo Peng a écrit : > > > > > > &g

Re: Should Arrow adopt C++14 / 17?

2019-10-04 Thread Zhuo Peng
On 2019/10/04 17:05:00, Antoine Pitrou wrote: > > Le 04/10/2019 à 19:01, Zhuo Peng a écrit : > > > > backports are cool for internal use, but probably not so if a public API > > accepts it? (because you vendor the headers in (i.e. namespace, symbol > > name

Re: Should Arrow adopt C++14 / 17?

2019-10-04 Thread Zhuo Peng
On 2019/10/04 16:53:59, Antoine Pitrou wrote: > > Le 04/10/2019 à 18:05, Zhuo Peng a écrit : > > Dear Arrow maintainers, > > > > Sorry if this was raised before. I did search the mailing list but "C++" > > matched too many results.. > > &

Should Arrow adopt C++14 / 17?

2019-10-04 Thread Zhuo Peng
Dear Arrow maintainers, Sorry if this was raised before. I did search the mailing list but "C++" matched too many results.. With manylinux1 (GCC4.8) being sunset, both Conda and Pypa are providing a modern enough toolchain (Conda Forge - GCC7; Pypa manylinux2010 docker - devtoolset-8(GCC8)). A

[jira] [Created] (ARROW-6775) Proposal for several Array utility functions

2019-10-02 Thread Zhuo Peng (Jira)
Zhuo Peng created ARROW-6775: Summary: Proposal for several Array utility functions Key: ARROW-6775 URL: https://issues.apache.org/jira/browse/ARROW-6775 Project: Apache Arrow Issue Type: Wish

Re: [DISCUSS] C-level in-process array protocol

2019-09-19 Thread Zhuo Peng
On Thu, Sep 19, 2019 at 10:56 Antoine Pitrou wrote: > > Le 19/09/2019 à 19:52, Zhuo Peng a écrit : > > > > The problems are only potential and theoretical, and won't bite anyone > > until it occurs though, and it's more likely to happen with pip/wheel > tha

Re: [DISCUSS] C-level in-process array protocol

2019-09-19 Thread Zhuo Peng
n. [1] https://github.com/tensorflow/tensorflow/issues/23561 Regards > > Antoine. > > > Le 19/09/2019 à 18:21, Zhuo Peng a écrit : > > Hi Antoine, > > > > I'm also interested in a stable ABI (previously I posted on this mailing > > list about the AB

Re: [DISCUSS] C-level in-process array protocol

2019-09-19 Thread Zhuo Peng
Hi Antoine, I'm also interested in a stable ABI (previously I posted on this mailing list about the ABI issues I had [1]). Does having such an ABI-stable C-struct imply that there will be a set of C-APIs exposed by the Arrow (C++) library (which I think would lead to a solution to all the inherit

Re: [VOTE] Release Apache Arrow 0.14.1 - RC0

2019-07-17 Thread Zhuo Peng
Hi Krisztián, Sorry if it's too late, but is it possible to also include https://github.com/apache/arrow/pull/4883 in the release? This would help resolve https://github.com/apache/arrow/issues/4472 . Thanks, Zhuo On Wed, Jul 17, 2019 at 3:00 AM Antoine Pitrou wrote: > > +1 (binding). > > Tes

[jira] [Created] (ARROW-5894) libgandiva.so.14 is exporting libstdc++ symbols

2019-07-09 Thread Zhuo Peng (JIRA)
Zhuo Peng created ARROW-5894: Summary: libgandiva.so.14 is exporting libstdc++ symbols Key: ARROW-5894 URL: https://issues.apache.org/jira/browse/ARROW-5894 Project: Apache Arrow Issue Type: Bug

Re: How should a Python/C++ project depend on Arrow (issues with ABI and wheel)?

2019-06-28 Thread Zhuo Peng
sallow STL classes in arrow's public headers. This might not be feasible, because std::shared_ptr and std::vector are used everywhere. Or maybe we only allow some "safe" STL classes in the public headers. But there is no guarantee for them to be safe. It's purely empirical.

[jira] [Created] (ARROW-5749) [Python] Add Python binding for Table::CombineChunks()

2019-06-26 Thread Zhuo Peng (JIRA)
Zhuo Peng created ARROW-5749: Summary: [Python] Add Python binding for Table::CombineChunks() Key: ARROW-5749 URL: https://issues.apache.org/jira/browse/ARROW-5749 Project: Apache Arrow Issue

How should a Python/C++ project depend on Arrow (issues with ABI and wheel)?

2019-06-20 Thread Zhuo Peng
Dear Arrow maintainers, I work on several TFX (TensorFlow eXtended) [1] projects (e.g. TensorFlow Data Validation [2]) and am trying to use Arrow in them. These projects are mostly written in Python but has C++ code as Python extension modules, therefore we use both Arrow’s C++ and Python APIs. Ou

[jira] [Created] (ARROW-5635) Support "compacting" a table

2019-06-17 Thread Zhuo Peng (JIRA)
Zhuo Peng created ARROW-5635: Summary: Support "compacting" a table Key: ARROW-5635 URL: https://issues.apache.org/jira/browse/ARROW-5635 Project: Apache Arrow Issue Type: I

[jira] [Created] (ARROW-5554) Add a python wrapper for arrow::Concatenate

2019-06-11 Thread Zhuo Peng (JIRA)
Zhuo Peng created ARROW-5554: Summary: Add a python wrapper for arrow::Concatenate Key: ARROW-5554 URL: https://issues.apache.org/jira/browse/ARROW-5554 Project: Apache Arrow Issue Type

[jira] [Created] (ARROW-5528) Concatenate() crashes when concatenating empty binary arrays.

2019-06-07 Thread Zhuo Peng (JIRA)
Zhuo Peng created ARROW-5528: Summary: Concatenate() crashes when concatenating empty binary arrays. Key: ARROW-5528 URL: https://issues.apache.org/jira/browse/ARROW-5528 Project: Apache Arrow