Hi all, Dong(cc'ed) and I are opening this thread to discuss our proposal to add operator attribute to allow operator to specify support for object-reuse [1].
Currently, the default configuration for pipeline.object-reuse is set to false to avoid data corruption, which can result in suboptimal performance. We propose adding APIs that operators can utilize to inform the Flink runtime whether it is safe to reuse the emitted records. This enhancement would enable Flink to maximize its performance using the default configuration. Please refer to the FLIP document for more details about the proposed design and implementation. We welcome any feedback and opinions on this proposal. Best regards, Dong and Xuannan [1] https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=255073749