Re: [DISCUSS] [BEAM-4126] Deleting Maven build files (pom.xml) grace period?

2018-06-18 Thread Lukasz Cwik
Any updates on BEAM-4512? On Mon, Jun 11, 2018 at 1:42 PM Lukasz Cwik wrote: > Thanks all, it seems as though only Google needs the grace period. I'll > wait for the shorter of BEAM-4512 or two weeks before merging > https://github.com/apache/beam/pull/5571 > > > On Wed, Jun 6, 2018 at 8:29 PM K

Supporting Unbounded Limit in Beam SQL Shell

2018-06-18 Thread Rui Wang
Hi, I want to support "unbounded limit" in Beam SQL shell (jira: https://issues.apache.org/jira/browse/BEAM-4194). Unbound limit means running limit query on unbounded input data (e.g. Apache Kafka, Cloud Pub/Sub). One example query is "SELECT col_name FROM unbounded_table LIMIT 1". This is usefu

Re: [VOTE] Apache Beam, version 2.5.0, release candidate #2

2018-06-18 Thread Boyuan Zhang
Follow up with pervious email( sorry for the inconvenience). Hey JB, If you haven't built python wheels yet, then you can use files in the previous email to move forward. If you have done all of them, then please ignore previous email. Thanks so much for your help. Boyuan On Mon, Jun 18, 2018

Re: [VOTE] Apache Beam, version 2.5.0, release candidate #2

2018-06-18 Thread Boyuan Zhang
Hey JB, I built python wheels and you can get them here: https://storage.googleapis.com/python_wheels/python_wheels.zip. You probably need to sign them and upload them into https://dist.apache.org/repos/dist/dev/beam/2.5.0/ like what we did for 2.4.0: https://dist.apache.org/repos/dist/dev/beam/2.

Re: [PROPOSAL] Merge samza-runner to master

2018-06-18 Thread Xinyu Liu
@Kenn: I am going to add the build.gradle. Is there anything else? @Ahmet, @Robert: here are more details about the samza runner right now: - Missing pieces: timer support in ParDo is not there yet and I plan to add it soon. SplittableParDo is missing but we don't have a use case so far. We are o

Re: [PROPOSAL] Merge samza-runner to master

2018-06-18 Thread Robert Burke
This is exciting! Is it implemented as a portability framework runner too? On Mon, Jun 18, 2018, 4:36 PM Pablo Estrada wrote: > It's very exciting to see a new runner making it into master. : ) > > Best > -P. > > On Mon, Jun 18, 2018 at 3:38 PM Rafael Fernandez > wrote: > >> I've just read this

Re: [VOTE] Apache Beam, version 2.5.0, release candidate #2

2018-06-18 Thread Pablo Estrada
+0.9 I've run the Mobile Gaming examples in Java+Dataflow, and others have documented their tests in the spreadsheet[1]. All seems to work well. We will need the wheel files to be available, and run a few validations on them, but other than that, the release looks good to me. Best -P. [1] https:

Re: [PROPOSAL] Merge samza-runner to master

2018-06-18 Thread Pablo Estrada
It's very exciting to see a new runner making it into master. : ) Best -P. On Mon, Jun 18, 2018 at 3:38 PM Rafael Fernandez wrote: > I've just read this and wanted to share my excitement :D > > > > On Mon, Jun 18, 2018 at 3:10 PM Kenneth Knowles wrote: > >> One thing that will be necessary is

Re: [PROPOSAL] Merge samza-runner to master

2018-06-18 Thread Ahmet Altay
Thank you for everyone who contributed to this runner. It is really great to see this. Xinyu, for the people like myself who were not following the development closely, could you talk about missing pieces, work in progress, future plans? On Mon, Jun 18, 2018 at 3:37 PM, Rafael Fernandez wrote:

Re: [PROPOSAL] Merge samza-runner to master

2018-06-18 Thread Rafael Fernandez
I've just read this and wanted to share my excitement :D On Mon, Jun 18, 2018 at 3:10 PM Kenneth Knowles wrote: > One thing that will be necessary is porting the build to Gradle. > > Kenn > > On Mon, Jun 18, 2018 at 11:57 AM Xinyu Liu wrote: > >> Hi, Folks, >> >> On behalf of the Samza team,

Re: [PROPOSAL] Merge samza-runner to master

2018-06-18 Thread Kenneth Knowles
One thing that will be necessary is porting the build to Gradle. Kenn On Mon, Jun 18, 2018 at 11:57 AM Xinyu Liu wrote: > Hi, Folks, > > On behalf of the Samza team, I would like to propose to merge the > samza-runner branch into master. The branch was created on Jan when we > first introduced

Re: Building and visualizing the Beam SQL graph

2018-06-18 Thread Andrew Pilloud
The name can't just be a constant string as it needs to be unique for every node in the graph. We currently build the name as the rel node class name plus the unique ID from the calcite graph (plus a unique ID from the beam graph, but that goes away with some bug fixes). I've written this up into

Re: CSVSplitter - Splittable DoFn

2018-06-18 Thread Robert Bradshaw
Anecdotal evidence is that most people are reading the csv files line-by-line with TextIO and then parsing into columns in a subsequent DoFn, ignoring (or asserting) that quoted newlines won't occur in their data. On Mon, Jun 18, 2018 at 11:27 AM Austin Bennett wrote: > Hi Beam Users/Dev, > > Ho

[PROPOSAL] Merge samza-runner to master

2018-06-18 Thread Xinyu Liu
Hi, Folks, On behalf of the Samza team, I would like to propose to merge the samza-runner branch into master. The branch was created on Jan when we first introduced the Samza Runner [1], and we've been adding features and refining it afterwards. Now the runner satisfies the criteria outlined in [2

Re: CSVSplitter - Splittable DoFn

2018-06-18 Thread Austin Bennett
Hi Beam Users/Dev, How are people handling currently handling CSVs as input to Beam (or not really doing so)? I see the things listed at the start of this thread -- any others? I have many batch workflows involve getting multi-GB CSV files from third party data aggregators (ex: hourly) and inges

Re: [VOTE] Apache Beam, version 2.5.0, release candidate #2

2018-06-18 Thread Jean-Baptiste Onofré
Hi, I thought I did it, let me double check. Regards JB On 18/06/2018 19:01, Boyuan Zhang wrote: > Hey JB, > > Could you please also build python wheels for RC2 and stage them in > https://dist.apache.org/repos/dist/dev/beam/2.5.0/? There are some > instructions in this PR: https://github.com/a

Re: [VOTE] Apache Beam, version 2.5.0, release candidate #2

2018-06-18 Thread Boyuan Zhang
Hey JB, Could you please also build python wheels for RC2 and stage them in https://dist.apache.org/repos/dist/dev/beam/2.5.0/? There are some instructions in this PR: https://github.com/apache/beam-site/pull/467. Hope it can be helpful. Thanks. Boyuan On Sun, Jun 17, 2018 at 9:22 PM Jean-Baptis

Moving Gradle build logic from build_rules.gradle to buildSrc

2018-06-18 Thread Kenneth Knowles
Hi all, **If you are actively working on build_rules.gradle then I want to coordinate with you** *If you are not interested in Gradle build configs, you can stop reading* I've hit some issues with dependency handling in build_rules.gradle that I think are best resolved by migrating it from being