[Questions] KafkaIO SplittableDoFn offset managment

2022-06-06 Thread Jean Wisser
criptor.of(). But since - if I understand correctly - a single (topic,partition) can be split into multiple workers, how can we make sure that we always commit offsets in the correct order ? Thanks a lot for your help, Jean. Jean Wisser Data Engineer [https://www.flowtraders.com/sites/flowtr

Re: [EXTERNAL] Re: [Questions] KafkaIO SplittableDoFn offset managment

2022-06-06 Thread Jean Wisser
Hi John,Thanks for your Jean Wisser Data Engineer [https://www.flowtraders.com/sites/flowtraders-corp/files/images/flow-traders-logo-mv.png] Flow Traders B.V. T: +31 20 799 6497 F: +31 20 799 6780 Jacob Bontiusplaats 9 1018 LL Amsterdam Nederland www.flowtraders.com<http://www.flowtraders.

Re: [EXTERNAL] Re: [Questions] KafkaIO SplittableDoFn offset managment

2022-06-06 Thread Jean Wisser
committed. I guess the reason for that is because in ReadFromKafkaDoFn in theinitialRestriction(), the consumer has still a different name (prefixed with "initialOffset"). Am I doing something wrong ? Thanks, Jean. Jean Wisser Data Engineer [https://www.flowtraders.com/sites/flowtraders-

Re: [EXTERNAL] Re: [Questions] KafkaIO SplittableDoFn offset managment

2022-06-07 Thread Jean Wisser
at happened? Using KafkaIO with ReadFromKafkaDoFn.java and commitOffsetsFinalize should commit offsets of processed messages and if the pipeline is restarted, should resume from the last committed offset. While committing the offset wo... Jean Wisser Data Engineer [https://www.flowtrader

Re: [EXTERNAL] Re: [Questions] KafkaIO SplittableDoFn offset managment

2022-06-08 Thread Jean Wisser
s [100, 150), the next restriction [150, 200) can be scheduled on a different worker ? Thanks for your help. Jean. Jean Wisser Data Engineer [https://www.flowtraders.com/sites/flowtraders-corp/files/images/flow-traders-logo-mv.png] Flow Traders B.V. T: +31 20 799 6497 F: +31 20 799

Can we use KafkaIO SplittableDoFn ?

2022-07-20 Thread Jean Wisser
ay of reading kafka messages with beam ? Thanks, Jean Wisser. Jean Wisser Data Engineer [https://www.flowtraders.com/sites/flowtraders-corp/files/images/flow-traders-logo-mv.png] Flow Traders B.V. T: +31 20 799 6497 F: +31 20 799 6780 Jacob Bontiusplaats 9 1018 LL Amsterdam Nede

FileIO continuously reading lots of new files

2022-10-10 Thread Jean Wisser
all files that were matched in the matchAll() operation. Any ideas on how to solve that ? Thanks, Jean Wisser. Jean Wisser Data Engineer [https://www.flowtraders.com/sites/flowtraders-corp/files/images/flow-traders-logo-mv.png] Flow Traders B.V. T: +31 20 799 6497 F: +31 20 799 6780 Jaco

PubsubIO to AvroIO huge fanout

2023-03-23 Thread Jean Wisser
reshuffle step right after parseFilesGenericRecords, but without any success. Any ideas on how to resolve that ? Thanks, Jean Wisser. Jean Wisser Data Engineer [https://www.flowtraders.com/sites/flowtraders-corp/files/images/flow-traders-logo-mv.png] Flow Traders B.V. T: +31 20 799 6497