Re: [PROPOSAL] Stop Spark2 support in Spark Runner

2022-04-20 Thread Alexey Romanenko
Hi everyone, A while ago, we already discussed on dev@ that there are several reasons to stop provide a support of Spark2 in Spark Runner (in all its variants that we have for now - RDD, Dataset, Portable) [1]. In two words, it brings some burden to Spark runner support that we would like to av

[ANNOUNCE] Apache Beam 2.38.0 Released

2022-04-20 Thread Daniel Oliveira
The Apache Beam team is pleased to announce the release of version 2.38.0. Apache Beam is an open source unified programming model to define and execute data processing pipelines, including ETL, batch and stream (continuous) processing. See https://beam.apache.org You can download the release her

JdbcIO

2022-04-20 Thread Eric Berryman
Hello, I have a rather simple use case where I would like to read a db table, which acts as a queue (~ hundreds millions events in initial load, but only thousands of events per day), and write that data out to a sink. This pipeline would be unbounded. I'm looking for reading material, and or cod

[Code Question] Pcollection to List using Java sdk

2022-04-20 Thread Kayal P
Hi All, I am trying to convert Pcollection to List using Java sdk. Seems there is combiners.ToList transform available in python sdk. Is there any similar option available in Java sdk? If not can someone guide me with right way of doing this? The Pcollection is very small collection less than