Hello,

We have question regarding transactional producer states and disk space
usage.

We did a quick and dirty test recently with 3 Simple Java client producers
writing to compressed and topics with compression.type set for both
producers and topic correctly. We performed two rounds of tests (with
Transactional and Non-Transactional Producers).

What we see is that the disk space usage quite high when there are
transactional producers. It's true that once the cleanup happens, some of
these data is expected to be cleaned up and we can reclaim disk space. But
until the files are closed, the disk space usage is quite significant
compared to non-transactional.

For our test, all our producers ran for 24 minutes. For transactional
producers, the kafka log directory apparent-size was 2GB. But for
transactional, it was 60 MB. Our disk space alert was fired based on the
apparent-size. And once we shutdown all our brokers, the directory sizes
changed to 2.2 MB (for txn) and 1.5 MB (for non-txn)

Is this expected for transactional producer-based writes? If so, what is
the recommendation for optimising producers (e.g. any official docs etc.) ?

Regards,

Reply via email to