[DISCUSS] - Propagation of Startup Errors

2025-01-06 Thread Luca Castelli
FLINK-34991 When there is a code error that crashes a job manager before the DAG is built, the real error (e.g. ClassNotFoundException) is not propagated correctly. Instead, the CRD shows: "HA metadata not available to restore from last state. I

[DISCUSS] Operator 1.10 - error with FINISHED jobs that lead to no clean-up

2025-02-03 Thread Luca Castelli
Hello, We're experiencing an issue with operator 1.10 where: 1. A batch job will successfully finish 2. The operator throws an error because it can't find the job ID 3. The job doesn't get properly cleaned up and stays stuck in RUNNING state 4. As a result, the Flink UI and JMs rem

[jira] [Created] (FLINK-37320) FINISHED jobs incorrectly being set to RECONCILING

2025-02-13 Thread Luca Castelli (Jira)
Luca Castelli created FLINK-37320: - Summary: FINISHED jobs incorrectly being set to RECONCILING Key: FLINK-37320 URL: https://issues.apache.org/jira/browse/FLINK-37320 Project: Flink Issue

[jira] [Created] (FLINK-37370) [Observer] Finished batch jobs throw ReconciliationException and never reach FINISHED in the CR

2025-02-21 Thread Luca Castelli (Jira)
Luca Castelli created FLINK-37370: - Summary: [Observer] Finished batch jobs throw ReconciliationException and never reach FINISHED in the CR Key: FLINK-37370 URL: https://issues.apache.org/jira/browse/FLINK-37370