Slow GroupAggregate and Sort

Darwin Correa Thu, 28 Dec 2023 09:03:18 -0800

Hello great day, we have a strange case with slow query and would like some 
help.




I've already read the article 
https://wiki.postgresql.org/wiki/Slow_Query_Questions



Explain: https://paste.depesz.com/s/PLP



Explain2: 
https://explain-postgresql.com/archive/explain/8e4b573c5f7bcf3a0d30675a430051fd:0:2023-12-26



Query: https://paste.depesz.com/s/fd3



DDL: https://paste.depesz.com/s/vBW



tunning: https://paste.depesz.com/s/dXa







We have citus cluster with the following configuration: 1 master + 3 data 
nodes, each machine have:

- 24 cores (Intel Xeon E5 2620)

- 192 GB RAM

- 1TB SSD



each node has configured postgres settings using tuning.sql



The main Table DDL is in (ddl.sql)



also distributed are as follow:



SELECT create_distributed_table('salert_post', 'id',shard_count := 72);



SELECT create_distributed_table('salert_q56', 'post',

                                colocate_with => 'salert_post');



when run the query (query.sql)  as you can see in explain (plan4_v3.txt) citus 
take about 18s to run all fragments

but each fragment take at most 2s, so my questions are- why citus take this 
time in run all fragments?

- if I tuned each postgres node efficiently why take much time to make sort and 
aggregate with citus results?



good night, I hope you can help me with some ideas





also we remove partitions, and test only with citus, but query took more than a 
minute.

as a note, we not have 72 shards on the same node we have 72 in total, 24 
shards each node.



I think the problem was in Sort and in GroupAggregate  I no have idea how speed 
up this in master node, because the Custom Scan (Citus Adaptive)  is not too 
slow, the most time is consumed in master on Sort and group



I hope you can help me.

Slow GroupAggregate and Sort

Reply via email to