Hello, I'm currently using flink 1.7.2.
I'm trying to run a job that's submitted programmatically using the ClusterClient API. public JobSubmissionResult run(PackagedProgram prog, int parallelism) The job makes use of some jars which I add to the packaged program through the Packaged constructor, along with the Jar file. public PackagedProgram(File jarFile, List<URL> classpaths, String... args) Normally, This works perfectly and the job runs fine. However, if there's an error in the job, and the job goes into failing state and when it's continously trying to restart the job for an hour or so, I notice a NoClassDefFoundError for some classes in the jars that I load using the URL class loader and the job never recovers after that, even if the root cause of the issue was fixed (I had a kafka source/sink in my job, and kafka was down temporarily, and was brought up after that). The jar is still available at the path referenced by the url classloader and is not tampered with. Could anyone please give me some pointers with regard to the reason why this could happen/what I could be missing here/how can I debug further ? thanks Subbu