Re: Tuning rocksdb configuration

Zakelly Lan Fri, 26 Jul 2024 02:01:41 -0700

Hi Banupriya,

Sometimes a sst will not be compacted and will be referenced for a long
time. That depends on how rocksdb picks the files for compaction. It may
happen when some range of keys is never touched at some point of time,
since the rocksdb only takes care of the files or key range that gets large.
Typically you don't need to worry about this, except for the checkpoint
size keeps getting large for a long time.



Best,
Zakelly

On Fri, Jul 26, 2024 at 2:49 PM banu priya <banuke...@gmail.com> wrote:

> Hi Zakelly,
>
> Thanks a lot for your reply.
>
> I have one more query,
>
> In side checkpoints chk-X directory there is a _metadata file, that
> contains list of other .sst files. In my chk-17000 directory it still
> refers to very old 00014.sst(latest is 225.sst). Why is it so??..
> compaction has happened and that's why 00001 to 00013 are not present. Why
> it didn't compact the very old file yet.  1.Do I need to change any other
> rocksdb property? Or 2.does it means my source events are still coming to
> same key and keeps that state??
>
> Window fires for every 2s, so I don't need it the data for long time.
>
> Thanks
> Banupriya
>
> On Fri, 26 Jul, 2024, 11:46 am Zakelly Lan, <zakelly....@gmail.com> wrote:
>
>> Hi Banu,
>>
>> I'm trying to answer your question in brief:
>>
>> 1. Yes, when the memtable reaches the value you configured, a flush will
>> be triggered. And no, sst files have different format with memtables, the
>> size is smaller than 64mb IIUC.
>>
>> 2. Typically you don't need to change this value. If it is set to 2, when
>> 1 write buffer is being flushed to storage, new writes can continue to the
>> other write buffer. Increase this when the flush is too slow.
>>
>> 3. IIUC, bloom filter helps during point query, and window processing
>> requires point queries. So enabling this would help.
>>
>> 4. I'd suggest not setting this to 0. This only affects whether the
>> checkpoint data is stored inline in the metadata file. Maybe the checkpoint
>> size is a little bit different, but it has nothing to do with the
>> throughput.
>>
>>
>> Best,
>> Zakelly
>>
>> On Thu, Jul 25, 2024 at 3:25 PM banu priya <banuke...@gmail.com> wrote:
>>
>>> Hi All,
>>>
>>> I have a flink job with RMQ Source, filters, tumbling window(uses
>>> processing time fires every 2s), aggregator, RMQ Sink. Enabled incremental
>>> rocksdb checkpoints for every 10s with minimum pause between checkpoints as
>>> 5s. My checkpoints size is keep on increasing , so I am planning to tune
>>> some rocksdb configuration.
>>>
>>>
>>>
>>> Following are my queries. Can someone help me choose a correct values.?
>>>
>>>
>>>
>>> 1.state.backend.rocksdb.writebuffer.size = 64 mb:
>>>
>>> Does it mean once write buffer (memtable) reaches 64 mb it will be
>>> flushed to disk as .sst file. Will .sst file also have size as 64mb?
>>>
>>>
>>>
>>> 2.state.backend.rocksdb.writebuffer.count = 2.
>>>
>>> My job is running with parallelism of 15 and 3 taskmanager(so 5 slots
>>> per taskmanager).  For single rocks DB folder, how can I choose the correct
>>> buffer count.?
>>>
>>> 3. do I need to enable bloom filter?
>>>
>>>  4. state.storage.fs.memory-threshold is 0 in my job. Does it have any
>>> effect in Taskmanager through put or check points size??
>>>
>>> Thanks
>>>
>>> Banu
>>>
>>

Re: Tuning rocksdb configuration

Reply via email to