I've commented on the PR. I'm +1 on the principle and on the proposed format / layout additions.
Regards Antoine. Le 14/12/2022 à 17:27, Matt Topol a écrit :
Hello, I'd like to propose adding the RLE type based on earlier discussions[1][2] to the Arrow format: - Columnar Format description: https://github.com/apache/arrow/pull/13333/files#diff-8b68cf6859e881f2357f5df64bb073135d7ff6eeb51f116418660b3856564c60 - Flatbuffers changes: https://github.com/apache/arrow/pull/14176/files#diff-e54b4f5d2d279acc5d1df5df9a7636f0142a8041fe02f07034e0d8be48444b07 There is a proposed implementation available in both C++ (written by Tobias Zagorni) and Go[3][4]. Both implementations have mostly the same tests implemented and were tested to be compatible over IPC with an archery test. In both cases, the implementations are split out among several Draft PRs so that they can be easily reviewed piecemeal if the vote is approved, with each Draft PR including the changes of the one before it. The links provided are the Draft PRs with the entirety of the changes included. The vote will be open for at least 72 hours. [ ] +1 add the proposed RLE type to the Apache Arrow format [ ] -1 do not add the proposed RLE type to the Apache Arrow format because... Thanks much, and please let me know if any more information or links are needed (I've never proposed a vote before on here!) --Matt [1] https://lists.apache.org/thread/bfz3m5nyf7flq7n6q9b1bx3jhcn4wq29 [2] https://lists.apache.org/thread/xb7c723csrtwt0md3m4p56bt0193n7jq [3] https://github.com/apache/arrow/pull/14179 [4] https://github.com/apache/arrow/pull/14223