I have created a PR to start a discussion around representing logical query
plans in Gandiva (ARROW-4163).

https://github.com/apache/arrow/pull/3319

I think that adding the various steps such as projection, selection, sort,
and so on are fairly simple and not contentious. The harder part is how we
represent data sources since this likely has different meanings to
different use cases. My thought is that we can register data sources by
name (similar to CREATE EXTERNAL TABLE in Hadoop) or tie this into the IPC
meta-data somehow so we can pass memory addresses and schema information.

I would love to hear others thoughts on this.

Thanks,

Andy.

Reply via email to