Cai Liuyang created FLINK-29577:
-----------------------------------

             Summary: Disable rocksdb wal when restore from full snapshot
                 Key: FLINK-29577
                 URL: https://issues.apache.org/jira/browse/FLINK-29577
             Project: Flink
          Issue Type: Improvement
          Components: Runtime / State Backends
            Reporter: Cai Liuyang


For now, RocksDBFullRestoreOperation and RocksDBHeapTimersFullRestoreOperation 
does's pass RocksDB::WriteOptions to RocksDBWriteBatchWrapper when restore 
kv-data, which will use RocksDBWriteBatchWrapper‘s default WriteOptions(doesn't 
disable rocksdb wal explicitly, see code below), so during restoring from full 
snapshot, wal is enabled(use more disk and affect rocksdb-write-performance 
when restoring)

 
{code:java}
// First: RocksDBHeapTimersFullRestoreOperation::restoreKVStateData() doesn't 
pass WriteOptions to RocksDBWriteBatchWrapper(null as default)

private void restoreKVStateData(
        ThrowingIterator<KeyGroup> keyGroups,
        Map<Integer, ColumnFamilyHandle> columnFamilies,
        Map<Integer, HeapPriorityQueueSnapshotRestoreWrapper<?>> 
restoredPQStates)
        throws IOException, RocksDBException, StateMigrationException {
    // for all key-groups in the current state handle...
    try (RocksDBWriteBatchWrapper writeBatchWrapper =
            new RocksDBWriteBatchWrapper(this.rocksHandle.getDb(), 
writeBatchSize)) {
        HeapPriorityQueueSnapshotRestoreWrapper<HeapPriorityQueueElement> 
restoredPQ = null;
        ColumnFamilyHandle handle = null;
   ......
}


// Second: RocksDBWriteBatchWrapper::flush function doesn't disable wal 
explicitly when user doesn't pass WriteOptions to RocksDBWriteBatchWrapper
public void flush() throws RocksDBException {
    if (options != null) {
        db.write(options, batch);
    } else {
        // use the default WriteOptions, if wasn't provided.
        try (WriteOptions writeOptions = new WriteOptions()) {
            db.write(writeOptions, batch);
        }
    }
    batch.clear();
}

{code}
 

 

As we known, rocksdb's wal is usesless for flink, so i think we can disable wal 
for RocksDBWriteBatchWrapper's default WriteOptions.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to