imay opened a new issue #1743: compute skew when concurrent queries URL: https://github.com/apache/incubator-doris/issues/1743 When we do high concurrency test to Doris, we found that one of all backends are complete consumed, but other backends have 60% CPU idle. ![image](https://user-images.githubusercontent.com/1249159/64230948-cefb7580-cf20-11e9-8a0d-977e1a7f106a.png) The test query likes "select day, sum(pv) from tbl where day = 20190904 group by day". After looking into backend's log, I found that all queries' second phase aggregation was processed in the exhausted backend. This is because Doris will assign the same backend for aggregation/join shuffle slot, this will cause compute skew when the query has smaller group by cardinality than number of backends.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@doris.apache.org For additional commands, e-mail: dev-h...@doris.apache.org