Hey all, We deploy application cluster natively on Kubernetes.
are there any timeouts for Job execution and cluster creation? I went over the configuration page here<https://ci.apache.org/projects/flink/flink-docs-stable/deployment/config.html> but did not find anything relevant. In order to get an indication about the cluster , we leverage the k8s client<https://github.com/fabric8io/kubernetes-client/blob/master/doc/CHEATSHEET.md#pods> to watch the deployment<https://github.com/fabric8io/kubernetes-client/blob/master/doc/CHEATSHEET.md#deployment#:~:text=Watching%20a%20Deployment%3A> in a namespace with specific cluster name and respond accordingly. we define two timeouts 1. Creating the application cluster (i.e. to date if there are errors in pods, the k8s deployment is up but the application cluster is not running.) 2. Until the application cluster resources get cleaned(upon completion) - which prevent an infinite job execution or k8s glitches However, this solution is not ideal because in case this client lib crashes, the timeouts are gone. We don't want to manage these timeouts states ourselves. Any suggestion or better way? Thanks, Tamir. [https://my-email-signature.link/signature.gif?u=1088647&e=145346582&v=3f32b726c93b8d93869d4a1520a346f1c12902a66bd38eb48abc091003335147] Confidentiality: This communication and any attachments are intended for the above-named persons only and may be confidential and/or legally privileged. Any opinions expressed in this communication are not necessarily those of NICE Actimize. If this communication has come to you in error you must take no action based on it, nor must you copy or show it to anyone; please delete/destroy and inform the sender by e-mail immediately. Monitoring: NICE Actimize may monitor incoming and outgoing e-mails. Viruses: Although we have taken steps toward ensuring that this e-mail and attachments are free from any virus, we advise that in keeping with good computing practice the recipient should ensure they are actually virus free.