I was investigating a big topic, trying to find the reason why the retention
limits were not working.
I was able to read messages from 2019 even though the retention was for 5 days.
Until I found the log segment file that contains those 2019 messages,
and checking the rest
of the messages I realized its have messages from Dec 2020. Yep,
messages from the future.

| offset: 4513685442 CreateTime: 1609420521000 keysize: -1 valuesize:
197 sequence: -1 headerKeys: [] payload: 2020-12-31T13:15:21+00:00 ...

Besides fixing how we use the CreateTime for ingesting data, I need to
remove that data.
So my two questions are:

a) How is the strategy kafka uses to decide if a log segment needs to
be deleted. It's using Max CreateTime and compare with retention
limits?

b) How can I delete that whole log segment file, and of course,
without messing the system?


Thanks.-
-- 
Richard Rossel
Atlanta - GA

Reply via email to