+1 (binding)
On Wed, Dec 25, 2024, at 07:21, L. C. Hsieh wrote: > +1 (binding) > > It looks like a useful feature for query systems, especially for the > ones that integrate native, JVM (e.g., Comet) and require statistics > passed across runtimes. > > Thank you. > > On Tue, Dec 24, 2024 at 5:52 AM Andrew Lamb <al...@influxdata.com> wrote: >> >> +1 (binding) >> >> I think the proposal is well reasoned, incorporates feedback so far, and >> will be generally useful (even within the Rust Arrow/DataFusion ecosystem) >> >> Thank you Kou for driving this initiative. >> >> Andrew >> >> On Mon, Dec 23, 2024 at 4:28 PM Edmondo Porcu <edmondo.po...@gmail.com> >> wrote: >> >> > +1 (non-binding) >> > >> > On Mon, Dec 23, 2024 at 10:15 AM Felipe Oliveira Carvalho < >> > felipe...@gmail.com> wrote: >> > >> > > +1 >> > > >> > > On Mon, Dec 23, 2024 at 2:37 AM Sutou Kouhei <k...@clear-code.com> wrote: >> > > >> > > > Hi, >> > > > >> > > > I would like to propose standardizing how to represent >> > > > statistics as Apache Arrow array. >> > > > >> > > > Motivation: >> > > > >> > > > * We want to pass not only Apache Arrow data but also >> > > > statistics of them through the C data interface for query >> > > > planning. >> > > > >> > > > Approach: >> > > > >> > > > * Define a standardized schema for statistics. >> > > > * Represent statistics as an Apache Arrow array that uses >> > > > the schema. >> > > > * Pass the statistics Apache Arrow array through the C data >> > > > interface like a normal Apache Arrow array. >> > > > >> > > > Note that we don't define a new interface for statistics. We >> > > > just use the existing C data interface. A statistics Apache >> > > > Arrow array is passed through a separated API call. >> > > > >> > > > Note that this proposal doesn't define anything about how or >> > > > where to use it. The above example just shows one use-case. >> > > > >> > > > This is based on the previous rejected vote discussion: >> > > > https://lists.apache.org/thread/rsw3wsyj68dksc98s5rpdp6dn8hfk0yd >> > > > >> > > > See also: >> > > > >> > > > * The discussion of this: >> > > > https://lists.apache.org/thread/b6chzlyn95rztoybs39b6olz907g12gj >> > > > * The PR of this proposal: >> > > > https://github.com/apache/arrow/pull/45058 >> > > > * The preview URL of the PR: >> > > > >> > > > >> > > >> > http://crossbow.voltrondata.com/pr_docs/45058/format/StatisticsSchema.html >> > > > >> > > > >> > > > The vote will be open for at least 72 hours. >> > > > >> > > > [ ] +1 Accept this proposal >> > > > [ ] +0 >> > > > [ ] -1 Do not accept this proposal because... >> > > > >> > > > >> > > > Thanks, >> > > > -- >> > > > kou >> > > > >> > > >> >