Re: Flink missing Kafka records

2021-04-29 Thread Dan Hill
Hey Arvid, I'll try to repo sometime in the next few weeks. I need to make some larger changes to get a full diff to see what is being dropped. On Thu, Apr 29, 2021 at 4:03 AM Arvid Heise wrote: > Hi Dan, > > could you check which records are missing? I'm suspecting it could be > records that

Re: Flink missing Kafka records

2021-04-29 Thread Arvid Heise
Hi Dan, could you check which records are missing? I'm suspecting it could be records that are emitted right before roll over of the bucket strategy from an otherwise idling partition. If so it could be indeed connected to idleness. Idleness tells Flink to not wait on the particular partition to

Re: Flink missing Kafka records

2021-04-26 Thread Dan Hill
Hey Robert. Nothing weird. I was trying to find recent records (not the latest). No savepoints (just was running about ~1 day). No checkpoint issues (all successes). I don't know how many are missing. I removed the withIdleness. The other parts are very basic. The text logs look pretty usele

Re: Flink missing Kafka records

2021-04-26 Thread Robert Metzger
Hi Dan, Can you describe under which conditions you are missing records (after a machine failure, after a Kafka failure, after taking and restoring from a savepoint, ...). Are many records missing? Are "the first records" or the "latest records" missing? Any individual records missing, or larger b