Rajesh Balamohan created HIVE-17035:
---------------------------------------

             Summary: Optimizer: Lineage transform() should be invoked after 
rest of the optimizers are invoked
                 Key: HIVE-17035
                 URL: https://issues.apache.org/jira/browse/HIVE-17035
             Project: Hive
          Issue Type: Bug
          Components: Logical Optimizer
            Reporter: Rajesh Balamohan
            Priority: Minor


In a fairly large query which had tens of left join, time taken to create 
linageInfo itself took 1500+ seconds. This is due to the fact that the table 
had lots of columns and in some processing, it ended up having 7000+ value 
columns in {{ReduceSinkLineage}}. 

It would be good to invoke lineage transform when rest of the optimizers in 
{{Optimizer}} are invoked. This would avoid help in improving the runtime.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to