alamb commented on issue #15323:
URL: https://github.com/apache/datafusion/issues/15323#issuecomment-2781577131

   > even if you use global tokio runtime and set the number of blocking 
threads to be a 1000 for example, there can be 1001 spill files. the problem is 
the same
   
   At some point the system is going to be IO bound so having more blocking 
threads doing I/O isn't going to help IO and will likely consume non trivial 
time context switching between them
   
   I think a better solution is to more carefully manage how many files are 
being spilled / read  at any time. This will be more complicated (as we'll 
likely have to do multiple merge phases, etc) but I think it is a better 
approach in the long run


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to