[ 
https://issues.apache.org/jira/browse/IMPALA-14417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-14417:
---------------------------------------
    Description: 
Currently, the JDBC external data source integration issues an additional count 
estimation query in the prepare phase to estimate the number of rows in a JDBC 
table. While this helps the planner choose a better parallelization strategy 
for complex queries (e.g. joins or aggregations), it also introduces an extra 
JDBC round trip. 

To give users more control, we propose introducing a query option that enables 
or disables row count estimation for JDBC tables. When enabled, Impala will 
issue the count estimation query during prepare. When disabled, Impala will 
skip this step, avoiding the additional JDBC round trip.

  was:
Currently, the JDBC external data source integration issues an additional 
COUNT(*) query in the {{prepare()}} phase to estimate the number of rows in a 
JDBC table. While this helps the planner choose a better parallelization 
strategy for complex queries (e.g. joins or aggregations), it also introduces 
an extra JDBC round trip. 

To give users more control, we propose introducing a query option that enables 
or disables row count estimation for JDBC tables. When enabled, Impala will 
issue the COUNT(*) query during prepare. When disabled, Impala will skip this 
step, avoiding the additional JDBC round trip.


> Query option for JDBC row count estimation
> ------------------------------------------
>
>                 Key: IMPALA-14417
>                 URL: https://issues.apache.org/jira/browse/IMPALA-14417
>             Project: IMPALA
>          Issue Type: New Feature
>            Reporter: Pranav Yogi Lodha
>            Priority: Major
>
> Currently, the JDBC external data source integration issues an additional 
> count estimation query in the prepare phase to estimate the number of rows in 
> a JDBC table. While this helps the planner choose a better parallelization 
> strategy for complex queries (e.g. joins or aggregations), it also introduces 
> an extra JDBC round trip. 
> To give users more control, we propose introducing a query option that 
> enables or disables row count estimation for JDBC tables. When enabled, 
> Impala will issue the count estimation query during prepare. When disabled, 
> Impala will skip this step, avoiding the additional JDBC round trip.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to