Joey Pereira created FLINK-17971: ------------------------------------ Summary: Speed up RocksDB bulk loading with SST generation and ingestion Key: FLINK-17971 URL: https://issues.apache.org/jira/browse/FLINK-17971 Project: Flink Issue Type: Improvement Components: Runtime / State Backends Reporter: Joey Pereira
RocksDB provides an API for creating SST files and ingesting them directly into RocksDB: [https://github.com/facebook/rocksdb/wiki/Creating-and-Ingesting-SST-files] Using this method for bulk loading data into RocksDB may provide a significant performance increase, specifically for paths doing inserts such as full savepoint recovery and state migrations. This is one method of optimizing bulk loads, as described in https://issues.apache.org/jira/browse/FLINK-17288 This was discussed on the user maillist: [http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/RocksDB-savepoint-recovery-performance-improvements-td35238.html] A draft PR is here: [https://github.com/apache/flink/pull/12345/] -- This message was sent by Atlassian Jira (v8.3.4#803005)