[ https://issues.apache.org/jira/browse/IGNITE-24942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Roman Puchkovskiy updated IGNITE-24942: --------------------------------------- Description: PartitionMover makes a retry on an exception. Retries are made on each exception (including those that are not retriable), there is no retry limit and the retries might happen in the same thread, which sometimes leads to an infinite loop (resulting in StackOverflowException) is something is broken. # We need to differentiate which exceptions are retryable and which are not # For non-retryable ones, we should call FailureManager right away and stop retrying # For retryable ones, we should add a retry counter and stop handling an exception is a retryable when the counter reaches some limit (that is, stop retrying and notify FailureManager) > StackOverflowError in PartitionMover > ------------------------------------ > > Key: IGNITE-24942 > URL: https://issues.apache.org/jira/browse/IGNITE-24942 > Project: Ignite > Issue Type: Improvement > Reporter: Roman Puchkovskiy > Priority: Major > Labels: ignite-3 > > PartitionMover makes a retry on an exception. Retries are made on each > exception (including those that are not retriable), there is no retry limit > and the retries might happen in the same thread, which sometimes leads to an > infinite loop (resulting in StackOverflowException) is something is broken. > # We need to differentiate which exceptions are retryable and which are not > # For non-retryable ones, we should call FailureManager right away and stop > retrying > # For retryable ones, we should add a retry counter and stop handling an > exception is a retryable when the counter reaches some limit (that is, stop > retrying and notify FailureManager) -- This message was sent by Atlassian Jira (v8.20.10#820010)