Hi Jan, Thanks for you valuable response. Actually, both standalone and native K8s integration, we could have two ways to achieve the HA. * StatefulSet + FilesystemHAService(with leader election/retrieval removed)[1] * Deployment + NativeK8sHaService(native K8s leader election and config map to store meta info)[2]
And we already have some tickets for them. Hope we could make a progress in the next release cycle. [1]. https://issues.apache.org/jira/browse/FLINK-17598 [2]. https://issues.apache.org/jira/browse/FLINK-12884 Best, Yang Alexey Trenikhun <yen...@msn.com> 于2020年8月15日周六 下午12:26写道: > Thank you Arvid and Yang! > > ------------------------------ > *From:* Yang Wang <danrtsey...@gmail.com> > *Sent:* Thursday, August 13, 2020 8:09:13 PM > *To:* Arvid Heise <ar...@ververica.com> > *Cc:* Alexey Trenikhun <yen...@msn.com>; user <user@flink.apache.org> > *Subject:* Re: k8s job cluster using StatefulSet > > Hi Alexey, > > Actually, StatefulSets could also be used to start the JobManager and > TaskManager. > > So why do we suggest to use Deployment in the Flink documentation? > * StatefulSets requires the user to have persistent volume in the K8s > cluster. However, it is not always true, > especially for the unmanaged(self-build) K8s cluster. > * Flink uses Zookeeper and distributed storage(S3, GFS, etc.) to process > the fault tolerance. If you start multiple > JobManagers, the leader election and leader retrieval will be done via > Zookeeper. Also the meta information will > be stored in the Zookeeper. So it is unnecessary to use StatefulSet to > do more things. > * The local data of JobManager and TaskManager is ephemeral. It could be > discarded after crashed. > > > Best, > Yang > > > > > Arvid Heise <ar...@ververica.com> 于2020年8月13日周四 下午4:38写道: > > Hi Alexey, > > I don't see any issue in using stateful sets immediately. > > I'd recommend using one of the K8s operators or Ververica's community > edition [1] though if you start with a new setup as they may solve even > more issues that you might experience in the future. > > [1] https://www.ververica.com/getting-started > > On Mon, Aug 10, 2020 at 11:22 PM Alexey Trenikhun <yen...@msn.com> wrote: > > Hello, > Flink documentation suggests to use Deployments to deploy JM and TM for > kubernetes job cluster. Is any known potential issues with using > StatefulSets instead, seems StatefullSet provides uniqueness for JM during > upgrade/rollback, while with Deployments could be multiple JM pods (e.g.1 > terminating and 1 running) > > Thanks, > Alexey > > > > -- > > Arvid Heise | Senior Java Developer > > <https://www.ververica.com/> > > Follow us @VervericaData > > -- > > Join Flink Forward <https://flink-forward.org/> - The Apache Flink > Conference > > Stream Processing | Event Driven | Real Time > > -- > > Ververica GmbH | Invalidenstrasse 115, 10115 Berlin, Germany > > -- > Ververica GmbH > Registered at Amtsgericht Charlottenburg: HRB 158244 B > Managing Directors: Timothy Alexander Steinert, Yip Park Tung Jason, Ji > (Toni) Cheng > >