[ https://issues.apache.org/jira/browse/SPARK-11077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated SPARK-11077: ----------------------------------- Labels: pull-request-available (was: ) > Join elimination in Catalyst > ---------------------------- > > Key: SPARK-11077 > URL: https://issues.apache.org/jira/browse/SPARK-11077 > Project: Spark > Issue Type: New Feature > Components: SQL > Reporter: Ankur Dave > Assignee: Ankur Dave > Priority: Major > Labels: pull-request-available > > Join elimination is a query optimization where certain joins can be > eliminated when followed by projections that only keep columns from one side > of the join, and when certain columns are known to be unique or foreign keys. > This can be very useful for queries involving views and machine-generated > queries. > Adding join elimination to Catalyst requires (1) support for unique and > foreign key hints in logical plans, (2) methods in the DataFrame API to let > users provide these hints, and (3) an optimizer rule that eliminates unique > key outer joins and referential integrity joins when followed by an > appropriate projection. > This proposal is described in detail here: > https://docs.google.com/document/d/1-YgQSQywHfAo4PhAT-zOOkFZtVcju99h3dYQq-i9GWQ/edit?usp=sharing -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org