Re: KafkaIO Read Latency

2020-06-11 Thread Alexey Romanenko
Thank you for details. I expect that StartBundle and FinishBundle methods are called in your DoFn (KafkaMessageExtractor). Since it’s called before your Window transform, I don’t think it should affect it. Did you count how many records are processed by one bundle? What is actually happening

Re: KafkaIO Read Latency

2020-06-10 Thread Talat Uyarer
Hi, I check the time when StartBundle is called and do the same thing for FinishBundle then take the difference between Start and Finish Bundle times and report bundle latency. I put this metric on a step(KafkaMessageExtractor) which is right after the KafkaIO step. I dont know if this is related,

Re: KafkaIO Read Latency

2020-06-10 Thread Alexey Romanenko
Hi Talat, Could you elaborate what do you mean by “opening and closing bundle”? Sometimes, starting a KafkaReader can take time since it will seek for a start offset for each assigned partition but it happens only once at pipeline start-up and mostly depends on network conditions. > On 9 Jun 2