Re: Is chk-$id/_metadata created regardless of enabling externalized checkpoints?

2021-01-04 Thread Yun Gao
n as follows in Web UI (note that "Persist Checkpoints Externally" is "Disabled" in the final row): According to [2], externalized checkpoints: You can configure periodic checkpoints to be persisted externally. Externalized checkpoints write their meta data out to persisten

Re: Externalized checkpoints

2019-08-25 Thread Vishwas Siravara
Got it.Thank you On Thu, Aug 22, 2019 at 8:54 PM Congxian Qiu wrote: > Hi, Vishwas > > As Zhu Zhu said, you can set "state.checkpoints.num-retained"[1] to > specify the maximum number of completed checkpoints to retain. > maybe you can also ref the external checkpoint cleanup type[2] config for

Re: Externalized checkpoints

2019-08-22 Thread Congxian Qiu
Hi, Vishwas As Zhu Zhu said, you can set "state.checkpoints.num-retained"[1] to specify the maximum number of completed checkpoints to retain. maybe you can also ref the external checkpoint cleanup type[2] config for how to clean up the retained checkpoint[2] [1] https://ci.apache.org/projects/fl

Re: Externalized checkpoints

2019-08-21 Thread Zhu Zhu
Hi Vishwas, You can configure "state.checkpoints.num-retained" to specify the max checkpoints to retain. By default it is 1. Thanks, Zhu Zhu Vishwas Siravara 于2019年8月22日周四 上午6:48写道: > I am also using exactly once checkpointing mode, I have a kafka source and > sink so both support transactions

Re: Externalized checkpoints

2019-08-21 Thread Vishwas Siravara
I am also using exactly once checkpointing mode, I have a kafka source and sink so both support transactions which should allow for exactly once processing. Is this the reason why there is only one checkpoint retained ? Thanks, Vishwas On Wed, Aug 21, 2019 at 5:26 PM Vishwas Siravara wrote: > H

Externalized checkpoints

2019-08-21 Thread Vishwas Siravara
Hi peeps, I am externalizing checkpoints in S3 for my flink job and I retain them on cancellation. However when I look into my S3 bucket where the checkpoints are stored there is only 1 checkpoint at any point in time . Is this the default behavior of flink where older checkpoints are deleted when

Re: Externalized checkpoints and metadata

2018-04-26 Thread 周思华
10%3A+Unify+Checkpoints+and+Savepoints ? Thank you, Juan G. From: hao gao Date: Wednesday, 25 April 2018 at 20:25 To: Juan Gentile Cc: "user@flink.apache.org" , Oleksandr Nitavskyi Subject: Re: Externalized checkpoints and metadata Hi Juan, We modified the flink code

Re: Externalized checkpoints and metadata

2018-04-26 Thread Juan Gentile
at 20:25 To: Juan Gentile Cc: "user@flink.apache.org" , Oleksandr Nitavskyi Subject: Re: Externalized checkpoints and metadata Hi Juan, We modified the flink code a little bit to change the flink checkpoint structure so we can easily identify which is which you can read my note or th

Re: Externalized checkpoints and metadata

2018-04-25 Thread hao gao
helps Thanks Hao 2018-04-25 6:07 GMT-07:00 Juan Gentile : > Hello, > > > > We are trying to use externalized checkpoints, using RocksDB on Hadoop > hdfs. > > We would like to know what is the proper way to resume from a saved > checkpoint as we are currently running

Externalized checkpoints and metadata

2018-04-25 Thread Juan Gentile
Hello, We are trying to use externalized checkpoints, using RocksDB on Hadoop hdfs. We would like to know what is the proper way to resume from a saved checkpoint as we are currently running many jobs in the same flink cluster. The problem is that when we want to restart the jobs and pass the

Two questions on Flink externalized checkpoints

2018-04-07 Thread James Yu
I left a question on stackoverflow ( https://stackoverflow.com/questions/49712817/two-questions-on-flink-externalized-checkpoints) and ask again in mail-list in case you check mail more often. I have two questions on Flink externalized checkpoints (Q1) I can set "state.checkpoints.dir"

Re: Duplicated data when using Externalized Checkpoints in a Flink Highly Available cluster

2017-06-08 Thread Nico Kruber
roduce duplicates? Is > it because Kafka guarantees only At-least once delivery at the moment? > > Thanks, > Amara > > > > -- > View this message in context: > http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Duplica > ted-data-when-using-Externali

Re: Duplicated data when using Externalized Checkpoints in a Flink Highly Available cluster

2017-06-05 Thread F.Amara
FlinkKafkaConsumer onwards. Why does this component introduce duplicates? Is it because Kafka guarantees only At-least once delivery at the moment? Thanks, Amara -- View this message in context: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Duplicated-data-when-using-Externalized

Re: Duplicated data when using Externalized Checkpoints in a Flink Highly Available cluster

2017-06-04 Thread F.Amara
Kafka) | | | | | |--| |---| || Thanks, Amara -- View this message in context: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Duplicated-data-when-using-Externalized-Checkpoints-in-a-Flink-Highly-Available-cluster-tp13301p13481.html Sen

Re: Duplicated data when using Externalized Checkpoints in a Flink Highly Available cluster

2017-06-01 Thread Robert Metzger
processing for my application? Could you > please guide me on what I might have missed? > > Thanks, > Amara > > > > > -- > View this message in context: http://apache-flink-user- > mailing-list-archive.2336050.n4.nabble.com/Duplicated-data- > when-using-Externalized

Re: Duplicated data when using Externalized Checkpoints in a Flink Highly Available cluster

2017-05-30 Thread F.Amara
? Could you please guide me on what I might have missed? Thanks, Amara -- View this message in context: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Duplicated-data-when-using-Externalized-Checkpoints-in-a-Flink-Highly-Available-cluster-tp13301p13379.html Sent from the Apache

Re: Duplicated data when using Externalized Checkpoints in a Flink Highly Available cluster

2017-05-26 Thread Tzu-Li (Gordon) Tai
flink-user-mailing-list-archive.2336050.n4.nabble.com/Duplicated-data-when-using-Externalized-Checkpoints-in-a-Flink-Highly-Available-cluster-tp13301.html Sent from the Apache Flink User Mailing List archive. mailing list archive at Nabble.com.

Duplicated data when using Externalized Checkpoints in a Flink Highly Available cluster

2017-05-24 Thread F.Amara
hanks, Amara -- View this message in context: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/Duplicated-data-when-using-Externalized-Checkpoints-in-a-Flink-Highly-Available-cluster-tp13301.html Sent from the Apache Flink User Mailing List archive. mailing list archive at Nabble.com.

Re: Externalized Checkpoints vs Periodic Checkpoints

2017-02-02 Thread Till Rohrmann
oints are externalized checkpoints with some more properties. At the moment, Flink's checkpoint coordinator only retains the last successfully completed checkpoint. This means that whenever a new checkpoint completes then the last completed checkpoint will be discarded. This also applies to ex

Re: Externalized Checkpoints vs Periodic Checkpoints

2017-02-02 Thread Yassine MARZOUGUI
in memory. The former has the advantage that you don't > lose the checkpoints if you shutdown your cluster. They behave similar to > savepoints and in fact savepoints are externalized checkpoints with some > more properties. > > At the moment, Flink's checkpoint coo

Externalized Checkpoints vs Periodic Checkpoints

2017-02-01 Thread Yassine MARZOUGUI
Hi all, Could someone clarify the difference between externalized checkpoints[1] and regular periodic checkpoints[2]? Moreover, I have a question regarding the retention of checkpoints: For regular checkpoints, does the last checkpoint discard the previous ones? If yes, is that the case too for

Re: Why are externalized checkpoints deleted on Job Manager exit?

2016-11-15 Thread Cliff Resnick
. > > On Thu, Nov 3, 2016 at 3:23 PM, Ufuk Celebi wrote: > > I don't need the logs. Externalized checkpoints have been configured > > to be deleted when the job is suspended, too. When the YARN session is > > terminated, all jobs are suspended. > > > > The

Re: Why are externalized checkpoints deleted on Job Manager exit?

2016-11-03 Thread Ufuk Celebi
A fix is pending here: https://github.com/apache/flink/pull/2750 The behaviour on graceful shut down/suspension respects the cancellation behaviour with this change. On Thu, Nov 3, 2016 at 3:23 PM, Ufuk Celebi wrote: > I don't need the logs. Externalized checkpoints have been configured

Re: Why are externalized checkpoints deleted on Job Manager exit?

2016-11-03 Thread Ufuk Celebi
I don't need the logs. Externalized checkpoints have been configured to be deleted when the job is suspended, too. When the YARN session is terminated, all jobs are suspended. The behaviour seems like a bug. As a work around you have to cancel the job before you shut down the YARN session. L

Re: Why are externalized checkpoints deleted on Job Manager exit?

2016-11-03 Thread Ufuk Celebi
be going on here? > > > On Wed, 2 Nov 2016 at 21:52 Clifford Resnick wrote: >> >> Testing externalized checkpoints in a YARN-based cluster, configured with: >> >> >> env.getCheckpointConfig.enableExternalizedCheckpoints(ExternalizedCheckpointCleanup.RETAIN_

Re: Why are externalized checkpoints deleted on Job Manager exit?

2016-11-03 Thread Aljoscha Krettek
+Ufuk Ufuk recently worked on that, if I'm not mistaken. Do you have an Idea what could be going on here? On Wed, 2 Nov 2016 at 21:52 Clifford Resnick wrote: > Testing externalized checkpoints in a YARN-based cluster, configu

Why are externalized checkpoints deleted on Job Manager exit?

2016-11-02 Thread Clifford Resnick
Testing externalized checkpoints in a YARN-based cluster, configured with: env.getCheckpointConfig.enableExternalizedCheckpoints(ExternalizedCheckpointCleanup.RETAIN_ON_CANCELLATION); I can confirm that checkpoint is retained between cancelled jobs, however it’s deleted when the Job Manager