Re: [DISCUSS] FLIP-227: Support overdraft buffer

Anton Kalashnikov Wed, 27 Apr 2022 09:28:47 -0700

Hi fanrui,

Thanks for creating the FLIP.

In general, I think the overdraft is good idea and it should help indescribed above cases. Here are my thoughts about configuration:

Please, correct me if I am wrong but as I understand right now we havefollowing calculation.

maxBuffersNumber(per TaskManager) = Network memory(calculated viataskmanager.memory.network.fraction, taskmanager.memory.network.min,taskmanager.memory.network.max and total memory size) /taskmanager.memory.segment-size.

requiredBuffersNumber(per TaskManager) = (exclusive buffers *parallelism + floating buffers) * subtasks number in TaskManager

buffersInUseNumber = real number of buffers which used at currentmoment(always <= requiredBuffersNumber)

Ideally requiredBuffersNumber should be equal to maxBuffersNumber whichallows Flink work predictibly. But if requiredBuffersNumber >maxBuffersNumber sometimes it is also fine(but not good) since not allrequired buffers really mandatory(e.g. it is ok if Flink can notallocate floating buffers)

But if maxBuffersNumber > requiredBuffersNumber, as I understand Flinkjust never use these leftovers buffers(maxBuffersNumber -requiredBuffersNumber). Which I propose to use. ( we can actualy useeven difference 'requiredBuffersNumber - buffersInUseNumber' since ifone TaskManager contains several operators including 'window' which cantemporally borrow buffers from the global pool).

My proposal, more specificaly(it relates only to requesting buffersduring processing single record while switching to unavalability betweenrecords should be the same as we have it now):

* If one more buffer requested but maxBuffersPerChannel reached, thenjust ignore this limitation and allocate this buffers from anyplace(from LocalBufferPool if it has something yet otherwise fromNetworkBufferPool)

* If LocalBufferPool exceeds limit, then temporally allocate it fromNetworkBufferPool while it has something to allocate

Maybe I missed something and this solution won't work, but I like itsince on the one hand, it work from the scratch without anyconfiguration, on the other hand, it can be configuration by changingproportion of maxBuffersNumber and requiredBuffersNumber.

The last thing that I want to say, I don't really want to implement newconfiguration since even now it is not clear how to correctly configurenetwork buffers with existing configuration and I don't want tocomplicate it, especially if it will be possible to resolve the problemautomatically(as described above).



So is my understanding about network memory/buffers correct?

--

Best regards,
Anton Kalashnikov

27.04.2022 07:46, rui fan пишет:

Hi everyone,
Unaligned Checkpoint (FLIP-76 [1]) is a major feature of Flink. Iteffectively solves the problem of checkpoint timeout or slowcheckpoint when backpressure is severe.
We found that UC(Unaligned Checkpoint) does not work well when theback pressure is severe and multiple output buffers are required toprocess a single record. FLINK-14396 [2] also mentioned this issuebefore. So we propose the overdraft buffer to solve it.
I created FLINK-26762[3] and FLIP-227[4] to detail the overdraftbuffer mechanism. After discussing with Anton Kalashnikov, there arestill some points to discuss:
  * There are already a lot of buffer-related configurations. Do we
    need to add a new configuration for the overdraft buffer?
  * Where should the overdraft buffer use memory?
  * If the overdraft-buffer uses the memory remaining in the
    NetworkBufferPool, no new configuration needs to be added.
  * If adding a new configuration:
      o Should we set the overdraft-memory-size at the TM level or the
        Task level?
      o Or set overdraft-buffers to indicate the number of
        memory-segments that can be overdrawn.
      o What is the default value? How to set sensible defaults?
Currently, I implemented a POC [5] and verified it usingflink-benchmarks [6]. The POC sets overdraft-buffers at Task level,and default value is 10. That is: each LocalBufferPool can overdraw upto 10 memory-segments.
Looking forward to your feedback!

Thanks,
fanrui
[1]https://cwiki.apache.org/confluence/display/FLINK/FLIP-76%3A+Unaligned+Checkpoints
[2] https://issues.apache.org/jira/browse/FLINK-14396
[3] https://issues.apache.org/jira/browse/FLINK-26762
[4]https://cwiki.apache.org/confluence/display/FLINK/FLIP-227%3A+Support+overdraft+buffer[5]https://github.com/1996fanrui/flink/commit/c7559d94767de97c24ea8c540878832138c8e8fe
[6] https://github.com/apache/flink-benchmarks/pull/54

Re: [DISCUSS] FLIP-227: Support overdraft buffer

Reply via email to