Re: Using RocksDB as State Backend over a Distributed File System

Stefan Richter Thu, 26 Apr 2018 05:01:42 -0700

Hi,

I think there is a misunderstanding. RocksDB state backend always operates on 
local disk of the node that runs your task to give you optimal performance. You 
can think of this as a transient working area that does not require any 
durability. Durability always happens through checkpoints (or savepoints) 
which, in turn, go to a distributed storage. Checkpoints and checkpoints are 
like a consistent moment-in-time image of the backends content and can be used 
to recover under failure (checkpoints) or manually resume your job (savepoints).


Best,
Stefan

> Am 26.04.2018 um 13:16 schrieb Chirag Dewan <chirag.dewa...@yahoo.in>:
> 
> Hi,
> 
> I am working on a use case where I need to store a large amount of data in 
> state. I am using RocksDB as my state backend. Now to ensure data 
> replication, I want to store the RocksDB files in some distributed file 
> system.
> 
> From the documentation I can see that Flink recommends a list of FileSystem 
> to be used for state backend. Given here :
> 
> Apache Flink 1.4 Documentation: File Systems 
> <https://ci.apache.org/projects/flink/flink-docs-release-1.4/ops/filesystems.html>
> 
> Apache Flink 1.4 Documentation: File Systems
>  
> <https://ci.apache.org/projects/flink/flink-docs-release-1.4/ops/filesystems.html>
> 
> But I cannot figure out the file system for RocksDB. What are the 
> recommendations for File Systems to be used with  RocksDB? 
> 
> Thanks in advance.
> 
> Regards,
> 
> Chirag
>

Re: Using RocksDB as State Backend over a Distributed File System

Reply via email to