Re: RocksDB savepoint recovery performance improvements

2020-05-26 Thread Yun Tang
that is for quick fix at his scenario. Best Yun Tang From: Steven Wu Sent: Wednesday, May 27, 2020 0:36 To: Joey Pereira Cc: user@flink.apache.org ; Yun Tang ; Mike Mintz ; Shahid Chohan ; Aaron Levin Subject: Re: RocksDB savepoint recovery performanc

Re: RocksDB savepoint recovery performance improvements

2020-05-26 Thread Steven Wu
Yun, you mentioned that checkpoint also supports rescale. I thought the recommendation [1] is to use savepoint for rescale. [1] https://www.ververica.com/blog/differences-between-savepoints-and-checkpoints-in-flink On Tue, May 26, 2020 at 6:46 AM Joey Pereira wrote: > Following up: I've put tog

Re: RocksDB savepoint recovery performance improvements

2020-05-26 Thread Joey Pereira
Following up: I've put together the implementation, https://github.com/apache/flink/pull/12345. It's passing tests but is only partially complete, as it still needs some clean-up and configuration. I still need to try running this against a production cluster to check the performance, as well as ge

Re: RocksDB savepoint recovery performance improvements

2020-05-18 Thread Joey Pereira
Thanks Yun for highlighting this, it's very helpful! I'll give it a go with that in mind. We have already begun using checkpoints for recovery. Having these improvements would still be immensely helpful to reduce downtime for savepoint recovery. On Mon, May 18, 2020 at 3:14 PM Yun Tang wrote: >

Re: RocksDB savepoint recovery performance improvements

2020-05-18 Thread Yun Tang
Hi Joey Previously, I also looked at the mechanism to create on-disk SSTables as I planed to use RocksDB's benchmark to mock scenario in Flink. However, I found the main challenge is how to ensure the keys are inserted in a strictly increasing order. The key order in java could differ from the