Hi,
Multi-column sort_indices on record batch has been implemented:
https://github.com/apache/arrow/pull/8612
You'll be able to use it with Apache Arrow 3.0.0.
Thanks,
--
kou
In
"Sort int tuples across Arrow arrays in C++" on Thu, 3 Sep 2020 14:26:09
+0200,
Rares
There are various open source columnar database engines you could look
at to get inspiration for a varargs variant of sort_indices.
On Thu, Sep 3, 2020 at 9:26 AM Ben Kietzman wrote:
>
> Hi Rares,
>
> The arrow API does not currently support sorting against multiple columns.
> We'd welcome a JIRA
Hi Rares,
The arrow API does not currently support sorting against multiple columns.
We'd welcome a JIRA/PR to add that support.
One potential workaround is storing the tuple as a single column of
fixed_size_list(int32, 2), which could then be viewed [1] as int64 (for
which sorting
is supported).
Hello,
I have a set of integer tuples that need to be collected and sorted at a
coordinator. Here is an example with tuples of length 2:
[(1, 10),
(1, 15),
(2, 10),
(2, 15)]
I am considering storing each column in an Arrow array, e.g., [1, 1, 2, 2]
and [10, 15, 10, 15], and have the Arrow arr