janhoy opened a new pull request, #3940:
URL: https://github.com/apache/solr/pull/3940

   This test has failed 8% of the time since start of December, see 
https://develocity.apache.org/scans/tests?search.rootProjectNames=solr-root&search.timeZoneId=Europe%2FOslo&tests.container=org.apache.solr.search.TestQueryLimits&tests.sortField=FLAKY&tests.test=testAdjustShardRequestLimits
   
   <img width="1257" height="373" alt="Skjermbilde 2025-12-10 kl  16 24 18" 
src="https://github.com/user-attachments/assets/ee701285-f5ef-42b1-85c4-90d1e6f496ce";
 />
   
   My AI buddy believes it fails due to a too low timeAllowed in the initial 
request:
   
   ## Root Cause
   
   The test was failing due to a timing race condition in distributed grouped 
queries:
   
   1. Test Setup: Uses timeAllowed=500ms with sleep=100ms per request
   2. Grouped Query Phases: Distributed grouped queries require 2 phases
   3. Actual Timing (from failed run logs):
       - Phase 1: ~100ms (matching sleep)
       - Phase 2: ~436ms (sleep + query processing, especially term iteration)
       - Total: ~536ms > 500ms timeout
   4. The Issue: The 500ms budget was barely sufficient:
       - 200ms consumed by sleep operations
       - Only 300ms margin for query processing, network overhead, and system 
variability
       - Phase 2 query processing (term iteration in SolrRangeQuery) is 
variable and can exceed this tight margin
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to