Re: Sort int tuples across Arrow arrays in C++

2020-11-24 Thread Sutou Kouhei
Hi, Multi-column sort_indices on record batch has been implemented: https://github.com/apache/arrow/pull/8612 You'll be able to use it with Apache Arrow 3.0.0. Thanks, -- kou In "Sort int tuples across Arrow arrays in C++" on Thu, 3 Sep 2020 14:26:09 +0200, Rares

Re: Sort int tuples across Arrow arrays in C++

2020-09-03 Thread Wes McKinney
There are various open source columnar database engines you could look at to get inspiration for a varargs variant of sort_indices. On Thu, Sep 3, 2020 at 9:26 AM Ben Kietzman wrote: > > Hi Rares, > > The arrow API does not currently support sorting against multiple columns. > We'd welcome a JIRA

Re: Sort int tuples across Arrow arrays in C++

2020-09-03 Thread Ben Kietzman
Hi Rares, The arrow API does not currently support sorting against multiple columns. We'd welcome a JIRA/PR to add that support. One potential workaround is storing the tuple as a single column of fixed_size_list(int32, 2), which could then be viewed [1] as int64 (for which sorting is supported).

Sort int tuples across Arrow arrays in C++

2020-09-03 Thread Rares Vernica
Hello, I have a set of integer tuples that need to be collected and sorted at a coordinator. Here is an example with tuples of length 2: [(1, 10), (1, 15), (2, 10), (2, 15)] I am considering storing each column in an Arrow array, e.g., [1, 1, 2, 2] and [10, 15, 10, 15], and have the Arrow arr