Hey No it was unrelated. Actually this was confusing: the pipeline wasn’t stuck - it was working, it just didn’t show on Flink’s UI. I guess because it was a bounded PCollection but not sure.
Nir On Thu, 25 Feb 2021 at 22:06 Pablo Estrada <pabl...@google.com> wrote: > hi Nir! > was this fixed by the PR you submitted? > > On Wed, Feb 24, 2021 at 8:55 AM Nir Gazit <nir....@gmail.com> wrote: > >> Hey, >> When trying to read a file from S3 with a combine action, the pipeline >> seems to be stuck. When replacing it with a GCP source it works fine. >> Furthermore - if I comment out the Count.PerElement part it also works. >> >> Anyone has an idea why that is? >> >> lines = p | beam.io.ReadFromText('s3://...') >> transformed = ( >> lines >> | 'SplitLine' >> >> (beam.ParDo(WordExtractingDoFn()).with_output_types(unicode)) >> | 'Count' >> beam.combiners.Count.PerElement() >> | 'Format' >> beam.MapTuple(lambda w, c: f'{w}: {c}') >> ) >> >> transformed | 'Write' >> beam.io.WriteToText('s3://...') >> >> Thanks! >> Nir >> >