From: Yang Wang
Sent: Tuesday, March 23, 2021 11:17:18 PM
To: Alexey Trenikhun
Cc: Flink User Mail List
Subject: Re: Kubernetes HA - attempting to restore from wrong (non-existing)
savepoint
Hi Alexey,
>From your attached logs, I do not think the new start JobManager will recover
&g
; restarted on cancel, did not grab log at that time, but chances good that I
> will able to reproduce.
> Thanks,
> Alexey
>
> ------
> *From:* Yang Wang
> *Sent:* Sunday, March 14, 2021 7:50:21 PM
> *To:* Alexey Trenikhun
> *Cc:* Flink User M
2021 7:50:21 PM
> *To:* Alexey Trenikhun
> *Cc:* Flink User Mail List
> *Subject:* Re: Kubernetes HA - attempting to restore from wrong
> (non-existing) savepoint
>
> If the HA related ConfigMaps still exists, then I am afraid the data
> located on the distributed stora
From: Yang Wang
Sent: Sunday, March 14, 2021 7:50:21 PM
To: Alexey Trenikhun
Cc: Flink User Mail List
Subject: Re: Kubernetes HA - attempting to restore from wrong (non-existing)
savepoint
If the HA related ConfigMaps still exists, then I am afraid the data located on
the distributed storage
>
> Thanks,
> Alexey
> --
> *From:* Yang Wang
> *Sent:* Thursday, March 11, 2021 2:59 AM
> *To:* Alexey Trenikhun
> *Cc:* Flink User Mail List
> *Subject:* Re: Kubernetes HA - attempting to restore from wrong
> (non-existing) savepoint
>
> Hi Alexey,
>
> F
ary 28, 2021 10:04 PM
To: Alexey Trenikhun mailto:yen...@msn.com>>
Cc: Flink User Mail List mailto:user@flink.apache.org>>
Subject: Re: Kubernetes HA - attempting to restore from wrong (non-existing)
savepoint
Hi Alexey,
It seems that the KubernetesHAService works well since all the
List
> *Subject:* Re: Kubernetes HA - attempting to restore from wrong
> (non-existing) savepoint
>
> Hi Alexey,
>
> It seems that the KubernetesHAService works well since all the checkpoints
> have been cleaned up when the job is canceled.
> And we could find relat
Hi Yang,
The problem is re-occurred, full JM log is attached
Thanks,
Alexey
From: Yang Wang
Sent: Sunday, February 28, 2021 10:04 PM
To: Alexey Trenikhun
Cc: Flink User Mail List
Subject: Re: Kubernetes HA - attempting to restore from wrong (non-existing
From: Yang Wang
Sent: Sunday, February 28, 2021 10:04 PM
To: Alexey Trenikhun
Cc: Flink User Mail List
Subject: Re: Kubernetes HA - attempting to restore from wrong (non-existing)
savepoint
Hi Alexey,
It seems that the KubernetesHAService works well since all the checkpoints have
been cleaned u
Hi Alexey,
It seems that the KubernetesHAService works well since all the checkpoints
have been cleaned up when the job is canceled.
And we could find related logs "Found 0 checkpoints in
KubernetesStateHandleStore{configMapName='gsp--jobmanager-leader'}.".
However
Hello,
We have Flink job running in Kubernetes with Kuberenetes HA enabled (JM is
deployed as Job, single TM as StatefulSet). We taken savepoint with
cancel=true. Now when we are trying to start job using --fromSavepoint A, where
is A path we got from taking savepoint (ClusterEntrypoint reports
11 matches
Mail list logo