zyhui98 opened a new issue, #9814:
URL: https://github.com/apache/seatunnel/issues/9814

   ### Search before asking
   
   - [x] I had searched in the 
[issues](https://github.com/apache/seatunnel/issues?q=is%3Aissue+label%3A%22bug%22)
 and found no similar issues.
   
   
   ### What happened
   
   jdk8
   seatunnel 2.3.11
   
   ----
   job数量有100+,当重启主节点之后,会出现部分job pending或者unknow的情况。
   
   
   ### SeaTunnel Version
   
   2.3.11
   
   ### SeaTunnel Config
   
   ```conf
   hazelcast.partition.count: 1
       hazelcast.invocation.retry.pause.millis: 500
       hazelcast.integrity-checker.enabled: true
       hazelcast.invocation.max.retry.count: 10
       hazelcast.operation.call.timeout.millis: 25000
       hazelcast.operation.backup.timeout.millis: 25000  
       hazelcast.tcp.join.port.try.count: 30
       hazelcast.logging.type: log4j2
       hazelcast.operation.generic.thread.count: 70
       hazelcast.heartbeat.failuredetector.type: phi-accrual
       hazelcast.heartbeat.interval.seconds: 2
       hazelcast.max.no.heartbeat.seconds: 180
       hazelcast.heartbeat.phiaccrual.failuredetector.threshold: 10
       hazelcast.heartbeat.phiaccrual.failuredetector.sample.size: 200
       hazelcast.heartbeat.phiaccrual.failuredetector.min.std.dev.millis: 100
   ```
   
   ### Running Command
   
   ```shell
   no
   ```
   
   ### Error Exception
   
   ```log
   com.hazelcast.spi.exception.PartitionMigratingException: Partition is 
migrating! this: [seatunnel-fat-master2]:5801, partitionId: 0, operation: 
com.hazelcast.map.impl.operation.SetOperation, service: hz:impl:mapService
   at 
com.hazelcast.spi.impl.operationservice.impl.OperationRunnerImpl.ensureNoPartitionProblems(OperationRunnerImpl.java:385)
   at 
com.hazelcast.spi.impl.operationservice.impl.OperationRunnerImpl.run(OperationRunnerImpl.java:241)
   at 
com.hazelcast.spi.impl.operationservice.impl.OperationRunnerImpl.run(OperationRunnerImpl.java:213)
   at 
com.hazelcast.spi.impl.operationexecutor.impl.OperationThread.process(OperationThread.java:175)
   at 
com.hazelcast.spi.impl.operationexecutor.impl.OperationThread.process(OperationThread.java:139)
   at 
com.hazelcast.spi.impl.operationexecutor.impl.OperationThread.executeRun(OperationThread.java:123)
   at 
com.hazelcast.internal.util.executor.HazelcastManagedThread.run(HazelcastManagedThread.java:102)
   at ------ submitted from ------.()
   at 
com.hazelcast.internal.util.ExceptionUtil.cloneExceptionWithFixedAsyncStackTrace(ExceptionUtil.java:336)
   at 
com.hazelcast.spi.impl.operationservice.impl.InvocationFuture.returnOrThrowWithGetConventions(InvocationFuture.java:112)
   at 
com.hazelcast.spi.impl.operationservice.impl.InvocationFuture.resolveAndThrowIfException(InvocationFuture.java:100)
   at 
com.hazelcast.spi.impl.AbstractInvocationFuture.get(AbstractInvocationFuture.java:617)
   at 
com.hazelcast.map.impl.proxy.MapProxySupport.invokeOperation(MapProxySupport.java:479)
   at 
com.hazelcast.map.impl.proxy.MapProxySupport.setInternal(MapProxySupport.java:575)
   at com.hazelcast.map.impl.proxy.MapProxyImpl.set(MapProxyImpl.java:252)
   at com.hazelcast.map.impl.proxy.MapProxyImpl.set(MapProxyImpl.java:242)
   at 
org.apache.seatunnel.engine.server.dag.physical.SubPlan.updateStateTimestamps(SubPlan.java:422)
   at 
org.apache.seatunnel.engine.server.dag.physical.SubPlan.lambda$updatePipelineState$7(SubPlan.java:369)
   at 
org.apache.seatunnel.common.utils.RetryUtils.retryWithException(RetryUtils.java:48)
   at 
org.apache.seatunnel.engine.server.dag.physical.SubPlan.updatePipelineState(SubPlan.java:367)
   at 
org.apache.seatunnel.engine.server.dag.physical.SubPlan.makePipelineFailing(SubPlan.java:732)
   at 
org.apache.seatunnel.engine.server.dag.physical.SubPlan.updatePipelineState(SubPlan.java:387)
   at 
org.apache.seatunnel.engine.server.dag.physical.SubPlan.handleCheckpointError(SubPlan.java:612)
   at 
org.apache.seatunnel.engine.server.master.JobMaster.lambda$handleCheckpointError$8(JobMaster.java:593)
   at java.util.ArrayList.forEach(ArrayList.java:1249)
   at 
org.apache.seatunnel.engine.server.master.JobMaster.handleCheckpointError(JobMaster.java:590)
   at 
org.apache.seatunnel.engine.server.checkpoint.CheckpointManager.handleCheckpointError(CheckpointManager.java:174)
   at 
org.apache.seatunnel.engine.server.checkpoint.CheckpointCoordinator.handleCoordinatorError(CheckpointCoordinator.java:292)
   at 
org.apache.seatunnel.engine.server.checkpoint.CheckpointCoordinator.handleCoordinatorError(CheckpointCoordinator.java:278)
   at 
org.apache.seatunnel.engine.server.checkpoint.CheckpointCoordinator.lambda$createPendingCheckpoint$11(CheckpointCoordinator.java:714)
   at 
java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1590)
   at org.apache.seatunnel.api.tracing.MDCRunnable.run(MDCRunnable.java:43)
   at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
   at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
   at java.lang.Thread.run(Thread.java:745)折叠
   ```
   
   ### Zeta or Flink or Spark Version
   
   _No response_
   
   ### Java or Scala Version
   
   _No response_
   
   ### Screenshots
   
   _No response_
   
   ### Are you willing to submit PR?
   
   - [ ] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [x] I agree to follow this project's [Code of 
Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to