HappenLee opened a new issue #7966:
URL: https://github.com/apache/incubator-doris/issues/7966


   ### Search before asking
   
   - [X] I had searched in the 
[issues](https://github.com/apache/incubator-doris/issues?q=is%3Aissue) and 
found no similar issues.
   
   
   ### Description
   
   ## Motivation
   The grouping set query implemented by Doris is inferior to Kylin, Hive and 
other competitive products in terms of performance. The main reasons are as 
follows
   
   1. There are many ineffective data duplication in the process of 
implementation, and many functions that should not be calculated are 
calculated, for example, **grouping functions should not be calculated as group 
by columns.*
   
   2. **Aggregation push down is not implemented**, which will lead to a large 
amount of data duplication.
   
   ### What shoule we do
   
   1. Aggregate and push down, such as **grouping set ((a), (b), (a, b))*,  
should aggregate (a,b) first, and then duplicate the data of grouping set, 
which can reduce the generation of a large amount of duplicate data and greatly 
improve the query efficiency
   
   2. Reduce unnecessary funcion compute in grouping set query.
   
   
   ### Solution
   
   #TODO
   
   ### Are you willing to submit PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org
For additional commands, e-mail: commits-h...@doris.apache.org

Reply via email to