2010YOUY01 opened a new issue, #15553:
URL: https://github.com/apache/datafusion/issues/15553

   ### Is your feature request related to a problem or challenge?
   
   During external queries, temporary computation results can be spilled to 
disk, to let the queries to complete under limited memory.
   A new configuration to limit the maximum total spilled file disk usage is 
being added in https://github.com/apache/datafusion/pull/15520
   
   ### Describe the solution you'd like
   
   Adding a configuration to `datafusion-cli` like
   ```
   # By default, disk usage is not limited
   datafusion-cli -c 'select 1, 2 from foo';
   
   # Limit disk usage to 10GB
   datafusion-cli --disk-limit 10G -c 'select 1, 2 from foo'; 
   ```
   
   Here is a reference implementation for a very similar feature (adding 
configuration for memory pool) https://github.com/apache/datafusion/issues/7419
   
   ### Describe alternatives you've considered
   
   This solution is a little bit hack, in the long term it should better be 
configurable through SQL interface like
   ```
   set datafusion.runtime.disk_limit = 1GB;
   set datafusion.runtime.memory_limit = 100MB;
   
   select * from tbl order by c1;
   ```
   Tracking issue is in https://github.com/apache/datafusion/issues/15552
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to