[ 
https://issues.apache.org/jira/browse/IMPALA-10204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yida Wu updated IMPALA-10204:
-----------------------------
    Attachment: query_tpcds.sql

> Evaluate AdmitQuery params for efficiency
> -----------------------------------------
>
>                 Key: IMPALA-10204
>                 URL: https://issues.apache.org/jira/browse/IMPALA-10204
>             Project: IMPALA
>          Issue Type: Sub-task
>          Components: Distributed Exec
>            Reporter: Thomas Tauber-Marshall
>            Priority: Critical
>         Attachments: query_tpcds.sql
>
>
> In the first version of the AdmissionControlService, we're sending the entire 
> TQueryExecRequest/TQueryOptions as a sidecar to the admission controller. 
> There are various things contained in the TQueryExecRequest/TQueryOptions 
> that are not actually needed by the admission controller, and sending them 
> increases network load and query running time unnecessarily.
> We should evaluate how much of a perf impact there is due to this and how 
> much could actually be removed.
> Some small things may be non-trivial to remove and ultimately not worth it, 
> for example the tree of TPlanNodes contains some info needed by the admission 
> controller (eg. memory estimates) and some things that are not (eg. runtime 
> filter descriptors). Making two parallel trees, one with only 
> admission-required data (which would require extensive refactoring in the 
> planner or wasted work in the coordinator copying out the required parts from 
> what the planner returns) may be too complicated/introduce too much other 
> overhead to be worth it.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to