Re: Size of Checkpoints increasing with time

Kien Truong Wed, 24 Oct 2018 05:16:12 -0700

Hi,

Do you use incremental checkpoint ?

RocksDB is an append-only DB, so you will experience the steady increasein state size until a compaction occurs and old values of keys aregarbage-collected.

However, the average state size should stabilize after a while, if theload doesn't change.


Regards,

Kien


On 10/23/2018 7:03 PM, Sameer W wrote:

Hi,
We are using ValueState to maintain state. It is a pretty simple jobwith a keyBy operator on a stream and the subsequent map operatormaintains state in a ValueState instance. The transaction load is inbillion transactions per day. However the amount of state per key is alist of 18x6 long values which are constantly updated. We have about20 million keys and transactions are uniformly distributed acrossthose keys.
When the job starts the size of the checkpoints (Using RocksDB backedby S3) is low (order of 500 MB). However, after 12 hours of operationthe checkpoint sizes have increased to about 4-5 GB. Time taken tocomplete the checkpoint starts around 15-20 seconds and after 12 hoursreaches about a minute.
What is the reason behind the increasing size of checkpoints?

Thanks,
Sameer

Re: Size of Checkpoints increasing with time

Reply via email to