On 4/13/22 7:58 PM, Rok Mihevc wrote:
Thanks for describing the use case Li!
The examples we ran are on UTC timestamp without any timezone
complications, perhaps there is room for short circuits when there are no
timezone complications...
I think using UTC zoned timestamp array might currently behave as a
regular timezoned timestamp array and use the zoned path.
However, setting timezone="" should use a non-zoned computation path.
See here:
https://github.com/apache/arrow/blob/master/cpp/src/arrow/compute/kernels/temporal_internal.h#L233
Rok
For many of the kernels, a comparison with memory bandwidth, for example
as measured using Likwid[1], would be a good test of performance of the
implementation. However, this would typically require use of SIMD, and
many initial implementations do not utilize SIMD operations, which at
the moment is mostly done through the XSIMD library[2]. Maybe this is
something to add to the developer documentation? There has been a
related discussion on the list of xsimd adoption in the Arrow codebase.
[1] https://github.com/RRZE-HPC/likwid/wiki/Likwid-Bench
[2] https://github.com/xtensor-stack/xsimd