imay opened a new issue #1743: compute skew when concurrent queries
URL: https://github.com/apache/incubator-doris/issues/1743
 
 
   When we do high concurrency test to Doris, we found that one of all backends 
are complete consumed, but other backends have 60% CPU idle. 
   
   
![image](https://user-images.githubusercontent.com/1249159/64230948-cefb7580-cf20-11e9-8a0d-977e1a7f106a.png)
   
   The test query likes "select day, sum(pv) from tbl where day = 20190904 
group by day". After looking into backend's log, I found that all queries' 
second phase aggregation was processed in the exhausted backend. 
   
   This is because Doris will assign the same backend for aggregation/join 
shuffle slot, this will cause compute skew when the query has smaller group by 
cardinality than number of backends.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@doris.apache.org
For additional commands, e-mail: dev-h...@doris.apache.org

Reply via email to