alamb commented on PR #14286:
URL: https://github.com/apache/datafusion/pull/14286#issuecomment-2707200014

   > > My other theory was that DataFusion might start requests but not consume 
them in time, thus leading to timeout errors even though the response was 
ready. However, I was not able to find any evidence that this was actually 
happening.
   > 
   > I'd expect precisely something like this if CPU is blocking IO tasks 
because the IO struggles to make progress and eventually times out. Couple that 
with retries and you can end up in scenarios where it's hard to tell what's 
causing what.
   
   One thing that I noticed is that DataFusion uses `ObjectStore::get_ranges`  
which returns buffered `Bytes` -- so from the network perspective I think the 
data is being consumed 🤔 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to