[ 
https://issues.apache.org/jira/browse/HIVE-25670?focusedWorklogId=680320&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-680320
 ]

ASF GitHub Bot logged work on HIVE-25670:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 11/Nov/21 15:34
            Start Date: 11/Nov/21 15:34
    Worklog Time Spent: 10m 
      Work Description: scarlin-cloudera commented on a change in pull request 
#2763:
URL: https://github.com/apache/hive/pull/2763#discussion_r747601749



##########
File path: 
ql/src/java/org/apache/hadoop/hive/ql/optimizer/calcite/RelOptHiveTable.java
##########
@@ -333,12 +360,20 @@ public boolean isKey(ImmutableBitSet columns) {
           parentTableQualifiedName.add(parentTableName);
           qualifiedName = parentTableName;
         }
+        if (!tablesCache.containsKey(qualifiedName)) {
+          // Table doesn't exist in the cache, so we don't need to track
+          // these referential constraints. But we do need to keep track
+          // of the table in case the tableCache gets populated later, though
+          // in theory, this should never happen based on how this is called.

Review comment:
       Ok, in my most recent push, I created a hash map wrapper.  This wrapper 
allows the caller to mark the fact that all tables have been parsed.  
   
   The call for getting the referential constraints is now only allowed when 
the table map is marked as parsed so we can be assured that the list of tables 
used in the query is complete when fetching the referential constraints.  I 
think I did this with minimal intrusion on the original code, though some of 
that still does need rewriting.
   
   Thanks for the review comments and making me think about this a bit more!




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 680320)
    Time Spent: 1h 10m  (was: 1h)

> Avoid getTable() calls for foreign key tables not used in a query
> -----------------------------------------------------------------
>
>                 Key: HIVE-25670
>                 URL: https://issues.apache.org/jira/browse/HIVE-25670
>             Project: Hive
>          Issue Type: Improvement
>          Components: HiveServer2
>            Reporter: Steve Carlin
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> In RelOptHiveTable, we generate the referential constraints for the table. In 
> this process, we make a metastore call to fetch these tables.  This is used 
> later on for potential gain on joins done on the key.
> However, there is no need to fetch these constraints if the table is not used 
> in the query. If we can get this information up front, we can save a bit on 
> compilation time.
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to