Re: Dump snapshot of big table in real time using StreamingFileSink

2019-01-24 Thread knur
Bump? 🙏 -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

Change Flink checkpoint configuration at runtime

2019-01-22 Thread knur
I'm running a streaming job that uses the following config: checkpointInterval = 5 mins minPauseBetweenCheckpoints = 2 mins checkpointTimeout = 1 minute maxConcurrentCheckpoints = 1 This is using incremental, async checkpoints with the RocksDb backend. So far around 2K checkpoints

Re: Dump snapshot of big table in real time using StreamingFileSink

2019-01-17 Thread knur
Hello Jamie. Thanks for taking a look at this. So, yes, I want to write only the last data for each key every X minutes. In other words, I want a snapshot of the whole database every X minutes. > The issue is that the window never get's PURGED so the data just > continues to accumulate in the wi

Dump snapshot of big table in real time using StreamingFileSink

2019-01-17 Thread knur
Hello there. So we have some Postgres tables that are mutable, and we want to create a snapshot of them in S3 every X minutes. So we plan to use Debezium to send a CDC log of every row change into a Kafka topic, and then have Flink keep the latest state of each row to save that data into S3 subseq