[ https://issues.apache.org/jira/browse/ARROW-4589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rok Mihevc updated ARROW-4589: ------------------------------ External issue URL: https://github.com/apache/arrow/issues/21132 > [Rust] [DataFusion] Implement projection push down query optimizer rule > ----------------------------------------------------------------------- > > Key: ARROW-4589 > URL: https://issues.apache.org/jira/browse/ARROW-4589 > Project: Apache Arrow > Issue Type: Improvement > Components: Rust - DataFusion > Affects Versions: 0.12.0 > Reporter: Andy Grove > Assignee: Andy Grove > Priority: Major > Labels: pull-request-available > Fix For: 0.13.0 > > Time Spent: 2h 20m > Remaining Estimate: 0h > > If I run a query like the following: > {code:java} > SELECT MIN(fare_amount), MAX(fare_amount) FROM tripdata{code} > I see this logical plan: > {code:java} > Logical plan: Aggregate: groupBy=[[]], aggr=[[MIN(#10), MAX(#10)]] > TableScan: tripdata projection=None{code} > > This means that every column is being loaded into arrays rather than just the > two columns that I care about, resulting in terrible performance. -- This message was sent by Atlassian Jira (v8.20.10#820010)