Thanks for starting that, Andy!

> I also think it could be helpful with in-memory language interoperability,
> such as passing query plans between Python and Rust.

Yes! I prototyped a datafusion-python and pyarrow datasets integration[1] a
few weeks ago that could really benefit from this. I'll have to look into
it more,
but will likely also need a method on PyArrow compute expressions to convert
to a Substrait expression.

[1] https://github.com/datafusion-contrib/datafusion-python/pull/21

On Mon, Mar 7, 2022 at 8:40 AM Wang Xudong <wxd963996...@gmail.com> wrote:

> Thank you!
> This is a great idea, I'll try to contribute some code when I have time!
>
> ---
> xudong
>
> Gavin Ray <ray.gavi...@gmail.com> 于2022年3月8日周二 00:36写道:
>
> > Incredibly exciting! Following along eagerly =)
> >
> > On Mon, Mar 7, 2022 at 11:31 AM Andy Grove <andygrov...@gmail.com>
> wrote:
> >
> > > I created a new repo in the datafusion-contrib GitHub org over the
> > weekend
> > > with a starting point for supporting DataFusion as both a producer and
> > > consumer of Substrait plans.
> > >
> > > https://github.com/datafusion-contrib/datafusion-substrait
> > >
> > > I am hopeful that we can eventually use Substrait in Ballista as a
> > > replacement for the current query plan protobuf format, meaning that
> the
> > > Ballista scheduler could potentially be used with engines other than
> > > DataFusion.
> > >
> > > I also think it could be helpful with in-memory language
> > interoperability,
> > > such as passing query plans between Python and Rust.
> > >
> > > I plan on continuing to merge my own PRs here as I flesh out more of
> > this,
> > > at least until there are other contributors.
> > >
> > > Thanks,
> > >
> > > Andy.
> > >
> >
>

Reply via email to