Re: Samza compaction policy

2019-11-07 Thread Malcolm McFarland
Actually, do you have an example of some appropriate settings for Kafka to ensure that compaction is behaving correctly for the Samza checkpoint topics? Cheers, Malcolm McFarland Cavulus This correspondence is from HealthPlanCRM, LLC, d/b/a Cavulus. Any unauthorized or improper disclosure, copyi

Re: Samza compaction policy

2019-11-07 Thread Malcolm McFarland
Hi Prateek, As far as I can gather, we are indeed seeing duplicate keys in the checkpoint topics. We definitely have cleanup.policy=compact on all of these topics. When you suggest that I check the topic partition size graph, is this just the checkpoint topics, or any that have cleanup.policy=comp

Re: Samza compaction policy

2019-11-06 Thread Prateek Maheshwari
Hi Malcolm, Using cleanup.policy=compact on the Kafka checkpoint topic should be sufficient, and is the default when the topic is created by Samza. Under normal operations, a checkpoint topic should only have ~ num task messages. I can suggest the following ways to identify the issue: 1. Read th