+1 (binding)

On Tue, Dec 24, 2024, 6:48 PM David Li <lidav...@apache.org> wrote:

> +1 (binding)
>
> On Wed, Dec 25, 2024, at 07:21, L. C. Hsieh wrote:
> > +1 (binding)
> >
> > It looks like a useful feature for query systems, especially for the
> > ones that integrate native, JVM (e.g., Comet) and require statistics
> > passed across runtimes.
> >
> > Thank you.
> >
> > On Tue, Dec 24, 2024 at 5:52 AM Andrew Lamb <al...@influxdata.com>
> wrote:
> >>
> >> +1 (binding)
> >>
> >> I think the proposal is well reasoned, incorporates feedback so far, and
> >> will be generally useful (even within the Rust Arrow/DataFusion
> ecosystem)
> >>
> >> Thank you Kou for driving this initiative.
> >>
> >> Andrew
> >>
> >> On Mon, Dec 23, 2024 at 4:28 PM Edmondo Porcu <edmondo.po...@gmail.com>
> >> wrote:
> >>
> >> > +1 (non-binding)
> >> >
> >> > On Mon, Dec 23, 2024 at 10:15 AM Felipe Oliveira Carvalho <
> >> > felipe...@gmail.com> wrote:
> >> >
> >> > > +1
> >> > >
> >> > > On Mon, Dec 23, 2024 at 2:37 AM Sutou Kouhei <k...@clear-code.com>
> wrote:
> >> > >
> >> > > > Hi,
> >> > > >
> >> > > > I would like to propose standardizing how to represent
> >> > > > statistics as Apache Arrow array.
> >> > > >
> >> > > > Motivation:
> >> > > >
> >> > > > * We want to pass not only Apache Arrow data but also
> >> > > >   statistics of them through the C data interface for query
> >> > > >   planning.
> >> > > >
> >> > > > Approach:
> >> > > >
> >> > > > * Define a standardized schema for statistics.
> >> > > > * Represent statistics as an Apache Arrow array that uses
> >> > > >   the schema.
> >> > > > * Pass the statistics Apache Arrow array through the C data
> >> > > >   interface like a normal Apache Arrow array.
> >> > > >
> >> > > > Note that we don't define a new interface for statistics. We
> >> > > > just use the existing C data interface. A statistics Apache
> >> > > > Arrow array is passed through a separated API call.
> >> > > >
> >> > > > Note that this proposal doesn't define anything about how or
> >> > > > where to use it. The above example just shows one use-case.
> >> > > >
> >> > > > This is based on the previous rejected vote discussion:
> >> > > > https://lists.apache.org/thread/rsw3wsyj68dksc98s5rpdp6dn8hfk0yd
> >> > > >
> >> > > > See also:
> >> > > >
> >> > > > * The discussion of this:
> >> > > >
> https://lists.apache.org/thread/b6chzlyn95rztoybs39b6olz907g12gj
> >> > > > * The PR of this proposal:
> >> > > >   https://github.com/apache/arrow/pull/45058
> >> > > > * The preview URL of the PR:
> >> > > >
> >> > > >
> >> > >
> >> >
> http://crossbow.voltrondata.com/pr_docs/45058/format/StatisticsSchema.html
> >> > > >
> >> > > >
> >> > > > The vote will be open for at least 72 hours.
> >> > > >
> >> > > > [ ] +1 Accept this proposal
> >> > > > [ ] +0
> >> > > > [ ] -1 Do not accept this proposal because...
> >> > > >
> >> > > >
> >> > > > Thanks,
> >> > > > --
> >> > > > kou
> >> > > >
> >> > >
> >> >
>

Reply via email to