[ 
https://issues.apache.org/jira/browse/IMPALA-14417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-14417:
---------------------------------------
    Description: 
Currently, the JDBC external data source integration issues an additional 
{{COUNT(*)}} query in the {{prepare()}} phase to estimate the number of rows in 
a JDBC table. While this helps the planner choose a better parallelization 
strategy for complex queries (e.g. joins or aggregations), it also introduces 
an extra JDBC round trip. 

To give users more control, we propose introducing a query option that enables 
or disables row count estimation for JDBC tables. When enabled, Impala will 
issue the {{COUNT(*)}} query during prepare. When disabled, Impala will skip 
this step, avoiding the additional JDBC round trip.

  was:
Currently, the JDBC external data source integration issues an additional 
{{COUNT(*)}} query in the {{prepare()}} phase to estimate the number of rows in 
a JDBC table. While this helps the planner choose a better parallelization 
strategy for complex queries (e.g. joins or aggregations), it also introduces 
an extra JDBC round trip. 

To give users more control, we propose introducing a query option that enables 
or disables row count estimation for JDBC tables. When enabled, Impala will 
issue the {{COUNT(*)}} query during prepare. When disabled, Impala will skip 
this step, avoiding the additional JDBC round trip.


> Query option for JDBC row count estimation
> ------------------------------------------
>
>                 Key: IMPALA-14417
>                 URL: https://issues.apache.org/jira/browse/IMPALA-14417
>             Project: IMPALA
>          Issue Type: New Feature
>            Reporter: Pranav Yogi Lodha
>            Priority: Major
>
> Currently, the JDBC external data source integration issues an additional 
> {{COUNT(*)}} query in the {{prepare()}} phase to estimate the number of rows 
> in a JDBC table. While this helps the planner choose a better parallelization 
> strategy for complex queries (e.g. joins or aggregations), it also introduces 
> an extra JDBC round trip. 
> To give users more control, we propose introducing a query option that 
> enables or disables row count estimation for JDBC tables. When enabled, 
> Impala will issue the {{COUNT(*)}} query during prepare. When disabled, 
> Impala will skip this step, avoiding the additional JDBC round trip.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to