Re: Bucketing Sink does not complete files, when source is from a collection

2018-04-04 Thread Fabian Hueske
Hi Josh, You are right, FLINK-2646 is related to the problem of non-finialized files. If we could distinguish the cases why close() is called, we could do a proper clean-up if the job terminated because all data was processed. Right now, the source and sink interfaces of the DataStream API are no

Re: Bucketing Sink does not complete files, when source is from a collection

2018-04-04 Thread joshlemer
Actually sorry, I have found that this is most likely a manifestation of https://issues.apache.org/jira/browse/FLINK-2646 as discussed elsewhere on the mailing list. That is, in the second example "fromCollection" the entire stream ends before a checkpoint is made. Let's hope this is fixed some day