Pierre Salagnac created SOLR-17971:
--------------------------------------

             Summary: REBALANCELEADERS command is horribly slow with 
distributed cluster processing
                 Key: SOLR-17971
                 URL: https://issues.apache.org/jira/browse/SOLR-17971
             Project: Solr
          Issue Type: Improvement
    Affects Versions: 9.0
            Reporter: Pierre Salagnac


Internally, the {{REBALANCELEADERS}} command checks the results of leader 
re-assignment requests by looking at overseer result distributed maps.

This is done in {{{}RebalanceLeaders.waitAsyncRequests(){}}}.

When the cluster is configured to run distributed cluster processing (no 
overseer), results of leader re-assignment commands are not stored in these 
map. This makes the parent {{REBALANCELEADERS}} to wait for a long time and 
eventually timeout. Then it continues processing.

That's unclear to me how the commands actually worked, but it seems leaders 
were correctly rebalanced even in case of a timeout. So there is no big 
functional impact.

Instead of checking the overseer result maps, the parent command should check 
the requests status by invoking {{REQUESTSTATUS}} for each asynchronous ID.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to