Re: [EXTERNAL] Re: Native K8S Jobmanager restarts and job never recovers

2020-11-01 Thread Yang Wang
client/pull/1800> > > > > > > > > > Best,> > > > Yang> > > > > > > Bohinski, Kevin 于2020年8月14日周五 上午6:32写道:> > > > > > > > Might be useful> > > > >> > > > > https://stackoverflow.com/a/6

Re: [EXTERNAL] Re: Native K8S Jobmanager restarts and job never recovers

2020-10-28 Thread Bohinski, Kevin
> > *From: *"Bohinski, Kevin" mailto:ke...@comcast.com>>> > > *Date: *Thursday, August 13, 2020 at 6:13 PM> > > *To: *Yang Wang mailto:da...@gmail.com>>> > > *Cc: *"user@flink.apache.org<mailto:user@flink.apache.org>" >

Re: [EXTERNAL] Re: Native K8S Jobmanager restarts and job never recovers

2020-08-13 Thread Yang Wang
or$Worker.run(ThreadPoolExecutor.java:624) > [?:1.8.0_262] > > at java.lang.Thread.run(Thread.java:748) [?:1.8.0_262] > > 2020-08-13 22:00:40,416 INFO > org.apache.flink.runtime.blob.BlobServer [] - Stopped > BLOB server at 0.0.0.0:6124 &g

Re: [EXTERNAL] Re: Native K8S Jobmanager restarts and job never recovers

2020-08-13 Thread Bohinski, Kevin
Might be useful https://stackoverflow.com/a/61437982 Best, kevin From: "Bohinski, Kevin" Date: Thursday, August 13, 2020 at 6:13 PM To: Yang Wang Cc: "user@flink.apache.org" Subject: Re: [EXTERNAL] Re: Native K8S Jobmanager restarts and job never recovers Hi G

Re: [EXTERNAL] Re: Native K8S Jobmanager restarts and job never recovers

2020-08-13 Thread Bohinski, Kevin
: Sunday, August 9, 2020 at 10:29 PM To: "Bohinski, Kevin" Cc: "user@flink.apache.org" Subject: [EXTERNAL] Re: Native K8S Jobmanager restarts and job never recovers Hi Kevin, I think you may not set the high availability configurations in your native K8s session. Currently, we on

Re: Native K8S Jobmanager restarts and job never recovers

2020-08-09 Thread Yang Wang
Hi Kevin, I think you may not set the high availability configurations in your native K8s session. Currently, we only support zookeeper HA, so you need to add the following configuration. After the HA is configured, the running job, checkpoint and other meta could be stored. When the jobmanager fa

Native K8S Jobmanager restarts and job never recovers

2020-08-07 Thread Bohinski, Kevin
Hi all, In our 1.11.1 native k8s session after we submit a job it will run successfully for a few hours then fail when the jobmanager pod restarts. Relevant logs after restart are attached below. Any suggestions? Best kevin 2020-08-06 21:50:24,425 INFO org.apache.flink.kubernetes.KubernetesR