xinyiZzz commented on pull request #8695: URL: https://github.com/apache/incubator-doris/pull/8695#issuecomment-1080746904
> Create hash table is expensive when expand hash table size. it can't just include network overhead, If we need an accurate cost model. You are right, the cost of constructing HashTable is really expensive, I have also analyzed it before: https://shimo.im/docs/yKvCqD68kDWKXRgT Do you have experience, Broadcast will probably be slower than Shuffle for what size HashTable. I am curious about this. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@doris.apache.org For additional commands, e-mail: commits-h...@doris.apache.org