Hi Sidhartha, This is a general limitation now because Flink does not keep counters for all buckets but only a global one. Flink assumes that the sink can write to any bucket any time and the counter is not reset to not rewrite the previously written file number 0.
Best, Andrey On Tue, Jul 30, 2019 at 7:01 AM Haibo Sun <sunhaib...@163.com> wrote: > Hi Sidhartha, > > Currently, the part counter is never reset to 0, nor is it allowed to > customize the part filename. So I don't think there's any way to reset it > right now. I guess the reason why it can't be reset to 0 is that it is > concerned that the previous parts will be overwritten. Although the bucket > id is part of the part file path, StreamingFileSink does not know when the > bucket id will change in the case of custom BucketAssginer. > > Best, > Haibo > > At 2019-07-30 06:13:54, "sidhartha saurav" <sidsau...@gmail.com> wrote: > > Hi, > > We are using StreamingFileSink with a custom BucketAssigner and > DefaultRollingPolicy. The custom BucketAssigner is simply a date bucket > assigner. The StreamingFileSink creates part files with name > "part-<subtask_number>-<count_of_the_bucket_created_by_that_subtask>". The > count is an integer and is incrementing on each rollover. Now my doubts > are: > > 1. When does this count reset to 0 ? > 2. Is there a way i can reset this count programmatically ? Since we are > using day bucket we would like the count to reset every day. > > We are using Flink 1.8 > > Thanks > Sidhartha > >