How about Apache Livy? the purpose is similar as SparkLauncher, but through Restful API to launch Spark Jobs
________________________________ From: mhd wrk <mhdwrkoff...@gmail.com> Sent: Monday, April 27, 2020 11:38 PM To: user@spark.apache.org <user@spark.apache.org> Subject: SparkLauncher reliability and scalability [Warning]: This email originated from an external source. Do not open links or attachments unless you know the content is safe. [경고]: 본 이메일은 회사 외부에서 유입되었습니다. 내용이 안전한지 확인하기 전까지는 링크나 첨부파일을 열지 마십시오. We are using SparkLauncher and SparkAppHandle.Listener to launch spark applications from a Java web application and listen to the state changes. Our observation is that as the number of concurrent jobs grow sometimes some of the state changes are not reported (e.g. some applications never report final state even when the corresponding spark job in YARN UI is marked FINISHED). I'm wondering if there are any guidelines/limits on launching (potentially large number of long running), concurrent spark jobs? Thanks,