Github user zhangminglei commented on the issue:

    https://github.com/apache/flink/pull/6075
  
    Hi, Thanks @StephanEwen . For a short checkpoint time that whether lead to 
poor compression or not, I will give an actual production test under 
compression situation and as an attachment attached to 
[FLINK-9407](https://issues.apache.org/jira/browse/FLINK-9407) in the next few 
days.
    
    Actually, We having been using this PR (but not polished yet) in our 
production environment for a long time. And getting a very nice results ending 
that compared to spark streaming. I will put the test results in the form of 
attachments to [FLINK-9407](https://issues.apache.org/jira/browse/FLINK-9407) 
as a reference. And what you are more concerned about the short checkpoint 
interval will lead to poor compression, yea I would agree but we need a test 
for it. But we have all known that the longer snapshots interval, the lower 
performance impact of asynchronous or synchronous snapshotting we would get. 
So, I think people do not inclined to adopt short checkpoint intervals for 
getting a high throughput and low latency in production env. For example, in 
our calculation of uv jobs, the checkpoint intervals is 30 seconds. Anyway, I 
will still give a test results under the compression situation.
    
    I suggest this PR can merge as a temporary solution to reduce the user's 
learning curve since some users already had used ```BucketingSink``` in their 
project and wants this functionality as we can see in the user mail list a few 
days ago.  And In a short time, they may not switch to a new sink 
```StreamingFileSink ``` . This may be one of the good reasons.
    
    By the way, I will watch  
[FLINK-9749](https://issues.apache.org/jira/browse/FLINK-9749) and the subtask  
 [FLINK-9753](https://issues.apache.org/jira/browse/FLINK-9753) soon and give 
more response in the next couple of days.


---

Reply via email to