Re: Working around lack of SQL triggers

2019-05-31 Thread deklanw
Thanks for the reply. Sorry for my lateness, I put this project on hold for awhile but I'm looking back into it. I think I see what you're saying but I'm not sure how to actually implement that parameterizable aggregation function. I understand what it would be conceptually, but I'm new to Scala

Re: Limitations in StreamingFileSink BulkFormat

2019-05-31 Thread Timothy Victor
Not an expert, but I would think this will not be trivial since the reason for using checkpointing to trigger is to guarantee exactly once semantics in the event of a failure which in turn is tightly integrated into the CP mechanism. The precursor the StreamingFileSink was BucketingFileSink which

Limitations in StreamingFileSink BulkFormat

2019-05-31 Thread Ayush Verma
Hello, I am using the StreamingFileSink BulkFormat in my Flink stream processing job to output parquet files to S3. Now the StreamingFileSink.BulkFormatBuilder

BigQuery source ?

2019-05-31 Thread Niels Basjes
Hi, Has anyone created a source to READ from BigQuery into Flink yet (we have Flink running on K8S in the Google cloud)? I would like to retrieve a DataSet in a distributed way (the data ... it's kinda big) and process that with Flink running on k8s (which we have running already). So far I have