PR is up: https://github.com/apache/arrow/pull/13364

Look forward to getting this in since there's a bunch of follow on
work that I'd like to get started on ASAP!

On Thu, Jun 9, 2022 at 7:34 AM Wes McKinney <wesmck...@gmail.com> wrote:
>
> I'm making good progress getting my branch PR-ready -- working through
> the compute-scalar-test suite and fixing the little things I broke. I
> hope I'll have it done by the end of the week.
>
> On Mon, Jun 6, 2022 at 3:21 PM Wes McKinney <wesmck...@gmail.com> wrote:
> >
> > I created https://issues.apache.org/jira/browse/ARROW-16755 as an
> > umbrella issue to track improvements to reduce overhead in the
> > expression and kernel execution machinery. Please help by attaching
> > related issues and creating new issues for specific individual efforts
> > here. I'll work as quickly as I can to have my initial patch
> > ARROW-16756 ready which will unblock the next few projects here
> >
> > On Mon, Jun 6, 2022 at 10:35 AM Wes McKinney <wesmck...@gmail.com> wrote:
> > >
> > >  This is definitely only the first stage of cleanup and streamlining —
> > > I anticipate multiple rounds of refactoring (maybe not as invasive and
> > > painful as this one), and this patch I'm not sure will do a lot to
> > > alleviate bottom line expression evaluation overhead but it creates
> > > the environment (i.e. where a whole chain of scalar functions that all
> > > write into preallocated memory can execute without having to touch
> > > shared_ptrs or deal with other objects with excess microperformance
> > > overhead) where such optimization can happen more easily.
> > >
> > >
> > > On Mon, Jun 6, 2022 at 4:08 AM Antoine Pitrou <anto...@python.org> wrote:
> > > >
> > > >
> > > > Le 06/06/2022 à 09:34, Sasha Krassovsky a écrit :
> > > > > Wow that's a lot of progress!
> > > > > Definitely agree on the scalar outputs point.
> > > > >
> > > > > One point about the ArraySpan - why does it need to know its data 
> > > > > type?
> > > > > Once a kernel has been resolved by the registry, the kernel will only 
> > > > > know
> > > > > how to execute on the specific type it was resolved for, right?
> > > >
> > > > Because of parametric types for example (e.g. timestamps with a unit and
> > > > timezone, or decimals with a precision and scale).
> > > >
> > > > Regards
> > > >
> > > > Antoine.

Reply via email to