Thanks for drafting the FLIP and kicking off the discussion, Yangze. Big +1 for this feature. Supporting using of GPU in Flink is significant, especially for the ML scenarios. I've reviewed the FLIP wiki doc and it looks good to me. I think it's a very good first step for Flink's GPU supports.
Thank you~ Xintong Song On Mon, Mar 2, 2020 at 12:06 PM Yangze Guo <karma...@gmail.com> wrote: > Hi everyone, > > We would like to start a discussion thread on "FLIP-108: Add GPU > support in Flink"[1]. > > This FLIP mainly discusses the following issues: > > - Enable user to configure how many GPUs in a task executor and > forward such requirements to the external resource managers (for > Kubernetes/Yarn/Mesos setups). > - Provide information of available GPU resources to operators. > > Key changes proposed in the FLIP are as follows: > > - Forward GPU resource requirements to Yarn/Kubernetes. > - Introduce GPUManager as one of the task manager services to discover > and expose GPU resource information to the context of functions. > - Introduce the default script for GPU discovery, in which we provide > the privilege mode to help user to achieve worker-level isolation in > standalone mode. > > Please find more details in the FLIP wiki document [1]. Looking forward to > your feedbacks. > > [1] > https://cwiki.apache.org/confluence/display/FLINK/FLIP-108%3A+Add+GPU+support+in+Flink > > Best, > Yangze Guo >