Hi everyone, We would like to start a discussion thread on "FLIP-108: Add GPU support in Flink"[1].
This FLIP mainly discusses the following issues: - Enable user to configure how many GPUs in a task executor and forward such requirements to the external resource managers (for Kubernetes/Yarn/Mesos setups). - Provide information of available GPU resources to operators. Key changes proposed in the FLIP are as follows: - Forward GPU resource requirements to Yarn/Kubernetes. - Introduce GPUManager as one of the task manager services to discover and expose GPU resource information to the context of functions. - Introduce the default script for GPU discovery, in which we provide the privilege mode to help user to achieve worker-level isolation in standalone mode. Please find more details in the FLIP wiki document [1]. Looking forward to your feedbacks. [1] https://cwiki.apache.org/confluence/display/FLINK/FLIP-108%3A+Add+GPU+support+in+Flink Best, Yangze Guo